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To Trina 


Preface 


This is the first of a two book series written for beginning graduate- 
level real analysis students and it is focused on essentials of set theo- 
ry, topology, and measure theory. The word “essentials” is often as- 
sociated with lecture notes, but this is exactly opposite to my inten- 
tion. While there are quite a few fine abstract analysis books, they 
are difficult to read, and an enthusiastic reader needs more time and 
effort to master the subject, as opposed to elementary level books 
that only in part address the needs of forthcoming mathematics and 
advanced engineering courses. 


That is why this book offers a thorough and yet rigorous treat- 
ment of key analysis subjects in abstract spaces by providing the 
reader with copious illustrations, examples, and exercises with se- 
lected solutions. Some difficult to understand topics are preceded by 
detailed discussions and blueprints, so that the reader will not get lost 
or intimidated in a long chain of proofs and notions. Furthermore, at 
the end of each section there is a summary of new terms and notation 
in the chronological order in which they appeared in the text. This 
should be helpful and relieve students of the potential overload of 
new words, definitions, and concepts. So, there is an increased likeli- 
hood of success by using this text in a course or by means of individ- 
ual self-study. 


My previous text, Real Analysis, published by Chapman and Hall 
in 2000, was the first effort to create this kind of book. However, this 
book only partially accomplished the goal I was striving to achieve. 
To fully realize that goal, it was necessary to write a new and ex- 
panded edition, including more topics and details, and it had to be 
produced as two books. The companion book, Advances in Abstract 
Analysis and Applications, includes further topics in topology and 
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measure theory, which justifies and rewards the reader for investing 
the time spent on “essentials.” 


As mathematical education has become increasingly more fo- 
cused on applications and less on theory, and in order to save them 
from extinction, academics have repurposed courses in set theory, to- 
pology, abstract algebra, and measure and integration as a real analy- 
sis course. At the same time, mathematical research, driven by seri- 
ous applications to other sciences, continued to require sound foun- 
dations. The pertinent precedents include physics, stochastic finance, 
mechanics, and now biology. There was even a time when some pro- 
ponents called real analysis “the single most important graduate 
course in mathematics to prepare for a career in operations research.” 


Today, real analysis is still very much alive, although it has un- 
dergone some significant modifications. One of these changes is that 
contemporary real analysis books include various, sometimes exotic, 
applications ranging from partial differentials equations to wavelet 
analysis, probability, and even physics. While such connections 
might be justified, one has to ensure that this propensity to connect 
analysis with remote disciplines does not relax its very substance. 
Consequently, facing a challenge of two alternatives to yield an over- 
sized or abridged book, a broad-spectrum project (under strong en- 
couragement from Springer) got bifurcated into two entirely differ- 
ently focused texts. 


Because real analysis, in its proper form, is likely to be the first 
abstract mathematics course that many students take, the associated 
topics should be taught in a strict order starting with basic set theory 
followed by point-set topology and then measure theory and integra- 
tion. Throughout my book I follow these principles. I strongly advo- 
cate the idea of introducing measure and integration in abstract 
spaces wasting no valuable time on Euclidean spaces. Consequently, 
Lebesgue measure and Lebesgue integral are reduced to mere illus- 
trations. The topology part, to be necessarily preceded by metric 
spaces, contains mostly fundamentals (such as bases, subbases, 
Hausdorff spaces, Tychonov product, and compactness). In partic- 
ular, old good sequential convergence is enough to proceed with rig- 
orous and comprehensive measure and integration (which accounts 
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to over 55% of my book). Such topics as filters and nets, locally 
compact Hausdorff spaces, Radon measures, and Hilbert spaces 
(measure-theoretic version) I consider relatively advanced and there- 
fore treat them in my forthcoming sequel to this book, along with 
various applications to stochastic analysis. 


It is absolutely impossible to produce a sound text without 
building on the foundations of my predecessors' important scholar- 
ship. I am very grateful for the valuable remarks and suggestions, 
made to the present and earlier edition, by Gustave Choquet, Jerald 
Folland, Jordan Stoyanov, Jiirgen Becker, Richard Syski, Jean 
Lasserre, Donald Konwinski, and Dean Spitzer. I am much indebted 
to Simon Smith, the creator of the EXP word processor, for his gen- 
erous and timely support. 


My deep appreciation also goes to Ms. Vaishali Damle, who is an 
extremely competent and efficient mathematics editor. Her constant 
help and patience was made available from the very beginning of the 
project and receiving her feedback was one of the key benefits of 
writing the book. Finally, I want to thank the Springer copyeditor, 
Ms. Valerie Greco, who did an excellent job in polishing the final 
draft of my manuscript. Because books are often written at home, I 
would be remiss in not ending with profound gratitude to my wife, 
Irina, who created an environment conducive to writing this book. 


J.H. Dshalalow 
Melbourne, Florida 
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Chapter 1 


Set-Theoretic and Algebraic 
Preliminaries 


Set theory is not just one of the main tools in mathematics, it is the 
very root of mathematics from which all mathematical disciplines 
stem. Due to a series of his papers, the first of which appeared in 
1874, the great German mathematician, Georg Ferdinand Cantor 
(1845 — 1918), is considered to be the sole founder of set theory . Al- 
though the Czech Bernard Bolzano (1781 — 1848) made one of the 
first attempts to formalize set theory, in particular in his Paradoxien 
des Unendlichen (of 1851), by considering the one-to-one correspon- 
dence between two sets (later on developed by Cantor to what we 
now know as cardinals), neither he, nor anyone else, was really a 
predecessor to Cantor's creation. Ernst Zermelo (1871 — 1953) was 
another German, who among his numerous contributions to set 
theory, is the author of the first axiom for set theory (of 1908) and 
undoubtedly one of the primary axioms of the whole mathematics. 


This chapter presents only essentials of set theory and abstract 
algebra needed throughout the book. 


1. SETS AND BASIC NOTATION 


Cantor defined a set as a collection M into a whole of definite and 
distinct objects (that are called elements of M) of our thought. In 
other words, we bind objects (perhaps of different nature) in our 
mind into a single entity and call that entity a set. 
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We denote sets by capital letters, and their elements by lower 
case letters. For instance, a set A has elements a, b,c, or a1, @,.... 
To abbreviate the expression “a is an element of the set A,” we write 
a € A. The expression “a ¢ A” reads “a is not an element of A.” 


Observe that the notion of a set is relatively simple if we deal 
with such frequently encountered sets as sets of integers, rational 
numbers, real numbers, or continuous functions. In some rare sit- 
uations, thoughtless use of this notion can lead to contradictions, as 
with Bertrand Russell's paradox. Russell posed the following set di- 
lemma. Let R be the set of all sets, which are not elements of 
themselves. Clearly, R is not empty. For instance, the set of all real 
numbers is not an element of itself (for it is not a real number), thus 
it belongs to R. The question arises: is R an element of itself? If 
R € R then by definition of 7, it should not belong to R, which is a 
contradiction. Thus, R ¢ R. But then, by definition, it must belong 
to R, which is impossible. In this case, we have put the definition of 
an object ahead of its existence. The concept of a set must be 
supported by axioms of set theory, just as the main axioms of plane 
geometry define the shape of lines. 


1.1 Definitions. 


(i) A set A is said to be a subset of a set B (in notation, A C B) 
if all elements of A are also elements of B. If A is a subset of B, we 
call B a superset of A (in notation, B > A). A set that contains 
exactly one element, say a, is called a singleton (set) and it is de- 
noted by {a}. If a € A, then we can alternatively write {a} C A. 
Any set is obviously a subset of itself: A C A. 


(it) The unique set with no elements is called the empty set and 
is denoted @. Clearly, @ is a subset of any set, including itself. 


(iti) A = B (read “set A equals set B’”’) if and only if AC B 
and B C A; otherwise, we will write A # B. Occasionally, we use 
the symbol “ Cc ” to appropriate the situation where one set is a sub- 
set of another set but the sets are not equal. A C B reads “A is a 
proper subset of B.” In this case, B is a proper superset of A (in 
notation, B > A). 
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We postulate the existence of a set that is a superset of all other 
sets in the framework of a certain mathematical model. This set is 
usually called a universal set or just universe. We also make use of 
the word “carrier” as a synonym for the universe and reserve for it 
the Greek letter Q. Sometimes, we denote it by X, Y, or Z. A 
universe (as a base for some mathematical model or problem) is ge- 
nerally defined to contain all considered sets and it varies from 
model to model. For example, if Cra a] denotes the set of all n-times 
differentiable functions on interval [a,b], it contains, as a subset, the 
set of possible solutions of an ordinary differential equation of the 
nth order. Thus, Q) = Ci.8] is a relevant universe within which the 
problem is posed. One could also take for 22 the set C;, 4) of all conti- 
nuous functions on [a,b] or even the set of all real-valued functions 
on [a,b]. However, these are “vast” to serve for universes and they 
are impractical for this concrete problem. 


Set theory is also a basic component of probability theory, which 
always begins with elements of set theory under a slightly modified 
lexicon. For instance, a universe is referred to as a sample space. 
Specially chosen (or measurable) subsets of the sample space are 
called events. In particular, measurable singletons are called elemen- 
tary events. 


The concept of the universe becomes most vivid when used in 
probability theory. Let us consider the experiment that consists of 
tossing a coin until the first appearance of the head on the upper face 
of the coin. Denoting H as an output of the head and 7’ as an output 
of the tail, when tossing the coin, we may define {(T,T,...,7, H)} 
as an elementary event of the sample space {2 filled with the 
elements {(H),(T,H), (7, 7,H),...}. The sample space Q 
contains, as elements, all possible outcomes of tossing the coin until 
the “first success” or the first appearance of the head. For instance, 
the event {(H), (T, H), (1,7, H)} corresponds to the “‘success in at 
most three tosses.” 


1.2 Notations. Throughout the whole book we use the following 
notation. 


(i) Logical symbols: 
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VY means “forall”. 

| means “there is” or “there are_lor “there exists _| 
= means “impliesior “from ... it follows that... 
> 

/\ 

V 


i= 


means “if'and only if 

means Lund |(also “&” on occasion). 
means Lor 
means “such that_|(used for definition of sets). 


(ii) Frequently used sets: 


N: _ the set of all positive integers. 

No: the set of all nonnegative integers. 

Z: the set of all integers. 

Q: the set of all rational numbers. 

Q°: the set of all irrational numbers. 

R: the set of all real numbers. 

C: _ the set of all complex numbers. 

R,: the set of all nonnegative real numbers. 
R_: the set of all negative real numbers. 


(iit) Denotation of sets: 


List: The elements are listed inside a pair of braces 
[for instance, {a, b,c} or {a1, ao,... }]. 

Condition: A description of the elements with a condition 
following a colon (that in this case reads “such 
that”), again with braces enclosing the set (for 
instance, The set of odd integers is {n € Z: 
n=2k+1,keZ}). 


(iv) Main set operations: 


Union. AUB={xEM cE AV ze Bh. 
Intersection. ANB={xeEN: ce AA xe Bh. 

Two subsets A, B C 2 are called disjoint if AN B = @. 
Difference: A\B={xE0: cE A A wx ¢ B} (A\B is 
also called the complement of B with respect to A, with the 
alternative notation A — B or B.) 
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Symmetric Difference: A A B = (A\B)U (B\A). 
Complement (with respect to the universe 2) : AC = Ag = 
Q\A. 

(v) General notation: 
“: =” reads “set by definition.” 

CL) means the end of a proof, remarks, examples, and so on. 


A set-algebraic expression is a set in the form of some defined 
sets connected through set operations. Any transformation of a set- 
algebraic expression into another expression would require a set-the- 
oretic manipulation we call a set-algebraic transformation. All basic 
set-algebraic transformations over basic set-algebraic expressions are 
known as laws of algebra (or calculus) of sets. O 


1.3 Remark. One of the standard tools of the algebra of sets is 
the so-called pick-a-point process applied to, say, showing that 
AC Bor A= B. It is based on the following. 


Axiom of Extent: For each set A and each set B, it is true that 
A = B if and only if for every « € Q, x € A when and only when 
ge B. 


Axiom's modification: 


If every element of A is an element of B, then A C B. 


Thus, for the modification, the pick-a-point process consists of 
selecting an arbitrary point x of A (picking a point x) and then 
proving that x also belongs to B. 

The identities below can be verified easily by the reader using 
pick-a-point techniques. O 


1.4 Theorem (Laws of Algebra of Sets). 


(2) Commutative Laws: 
AUB=BUA. 
ANB=BNA. 

(22) Associative Laws: 


(AUB)UC = AU(BUC). 
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(AN B)NC=AN(BNC). 
(iit) Distributive Laws: 
(AUB)NC = (ANC)U(BNC). 
(AN B)UC =(AUC)N(BUC). 
(iv) | Idempotence of 
complement: (A°)° = A 
union: AUA=A 
intersection: AN A= A. 
(v) AN AS =@. 
(vi) AUAS=9Q. 
(vii) De Morgan's Laws: 
(AU B)f = ASN BS. 
(ANB) =]Aeu Bs. 
(vit) AUO=A. 
Gz) ANGB=@. 
(x) Q° = @ and O° =. O 


1.5 Example. Show the validity of the first distributive law: 


zE(AUB)NC 
& «reE(AUB)ALrEC 
= [tEeAANZEC 
& tEANC)V([teBAxcEeC S&S rcEeBnc 
= ge (Ane) u(Bnic), O 


1.6 Remark. The concepts of union and intersection can be 
extended to an arbitrary family of sets. For instance, 


UA; ={# € 0:51 € 1, 2 € Aj}. 


iel 


The distributive laws and De Morgan's laws hold for arbitrary 
families [subject to Problem 1.1b)]: 
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(uas) nB= WAN) 


ieL el 


(n4\) UB=[)(A; UB) 


el iel 


(ua) = 945 


ove ie. 


(n4\) =U. Oo 


el ie. 


1.7 Definitions. 


(i) An indexed family F = {A; CQ: i € I} of sets is called 
(pairwise) disjoint, if for alli A j, A; NA; =@. 


Throughout this book, the union of a pairwise disjoint family of 
sets is denoted for convenience by }°,.,A;. Specifically, A+B 
means AU B, when A and B are disjoint. 


(ti) A decomposition of a set A is any representation of A as the 
union of a disjoint family of sets, A = )7,.,A;. The family {A;; 
i € I} is referred to as a partition of A. (There is another use of the 
term partition, applied to a different construction in a narrower 
sense. Namely, P is a partition of a closed interval [a,b] C Rif P is 
any ordered finite set of points {ao,...,an}C [a,b] with 
G=ag ep S22 ag Sb) 


(277) Let Q be a fixed set. The family of all subsets of ( is called 
the power set of Q and it is denoted by P(Q)). 


(iv) A sequence {A,:n=1,2,... } of sets is said to be 
monotone nondecreasing (nonincreasing), if 


A; C Ap C++» CA, C++: (Ay D Ag D+) D Ay D---). 


To specify the type of convergence, we write {A,} A ({A,}J| A). 
A sequence {A,,} of sets is said to be monotone vanishing, if it is 
monotone nonincreasing and {A,,}| 9. 
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(v) Let {A,,} be an arbitrary sequence of sets. Denote 


(a) lim inf A,, (or just lim A,,) = U al A, . This limit is 
called the limit inferior. 


(b) lim inf A, (or just lim A,) = () U Am. This limit 
n=l1m=n 


is called the /imit superior. 


If lim A, =lim A,,, then we denote their common limit by 
lim; 00 An. In this case, the limit of (the sequence) {A,,} is said to 
exist and equal lim,—..5 An. O 


PROBLEMS 


1.1. a) Prove Theorem 1.4, the laws of algebra of sets by using the 
pick-a-point process. 
b) Prove the generalized distributive laws and De Morgan's 
laws stated in Remark 1.6. 


1.2. Show that: 


a) (AUB)\C = (A\C)U(B\C). 
b) (AN B)\C = (A\C)N(B\C). 
c) C\(AUB) = (C\A)N(C\B) 
d) C\(AN B) = (C\A)U(C\B) 
e) (A\B) NC = AN(C\B). 


1.3 Show that A\B = AN B°. 


1.4 Let |A| =n (ie., the set A contains n elements). Show that 
|P(A)| = 2”. 


1.5 Prove that: 


a) (A\B)° = ACUB. 
b) [((A°U B)°U (AU B®)]° = B\A. 
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1.6 


1,7 


1.8 


1.9 


1.10 


c) (AN B)U(AN BUCA NB)=AUB. 


For each of the following, justify with a proof or give a count- 
erexample. 


a)\AUC=BUC = A=B, 
b) (AU B)\B=A. 
gjQA\p=C\B =} A=C. 
d) A°\ Be = B\A. 


Give an example of a monotone vanishing sequence of sets. 


Let {A,: n=1,2,... } be an arbitrary sequence of sets. 
Define Ap = () Ay and Ax = U An. 


n=1 n=1 
a) Construct a monotone nonincreasing sequence of sets 
{B,,} such that {B,,} | Ao. 


b) Construct a monotone nondecreasing sequence of sets 
{C;,} such that {C;,} 7 Aco. 


c) Given {C,}7 Ax, construct a pairwise disjoint 
sequence {D,,} such that °° ,D, = An. 


n=1 


Let {A,: n=1,2,...} be an arbitrary sequence of sets. 
Define Aj = () A, and Ax = LU Ap. Prove that 


n=1 n=1 
Ao = lim.A,, Cc limA, = Ags 


Let Q be an arbitrary set. Find a sequence { F,,} of subsets of 2 
such that 


lim E, = @ and lim E, = 2. 
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NEW TERMS: 


Cantor, Georg, 3 
Bolzano, Bernard, 3 
Zermelo, Ernst, 3 
set, 3 
element of a set, 3 
Russell's paradox, 4 
subset, 4 
superset, 4 
singleton, 4 
empty set, 4 
proper subset, 4 
proper superset, 4 
universe, 5 
carrier, 5 
sample space, 5 
measurable subset, 5 
event, 5 
elementary event, 5 
Logical Symbols, 6 

Vv means “for all” 

J means “there is” or “there are” or “there exists” 

=> means “implies” or “from ... it follows that ...” 

<> means “if and only if” 

/\ means “and” 

V means “or” 
means “such that” (used for definition of sets) 


N: _ the set of all positive integers 

No: _ the set of all nonnegative integers 

Z: the set of all integers 

Q: _ the set of all rational numbers 

Q*: the set of all irrational numbers 

R: the set of all real numbers 

C: the set of all complex numbers 

R: the set of all nonnegative real numbers 
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R_: the set of all negative real numbers 
union, 6 

intersection, 6 

disjoint sets, 6 

difference, 6 

complement of set B with respect to set A, 6 
symmetric difference, 7 

complement, 7 

set-algebraic expression, 7 

set-algebraic transformation, 7 

laws of algebra of sets, 7 

pick-a-point process, 7 

axiom of extent, 7 

commutative laws of sets, 7 

associative laws of sets, 7 

distributive laws of sets, 8 

idempotence, 8 

idempotence of complement, 8 
idempotence of union, 8 

idempotence of intersection, 8 

De Morgan's laws, 8 

pairwise disjoint sets, 9 

disjoint family of sets, 9 

decomposition of a set, 9 

partition of a set, 9 

partition of an interval, 9 

power set, 9 

monotone nondecreasing sequence of sets, 9 
monotone nonincreasing sequence of sets, 9 
monotone vanishing sequence of sets, 9 
limit inferior of a sequence of sets, 10 

limit superior of a sequence of sets, 10 

limit of a sequence, 10 
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The word “function” was introduced by Gottfried von Leibnitz in 
1694, initially as a term to denote any quantity related to a curve, 
such as its slope, the radius of curvature, and so on. The notion of the 
function was refined subsequently by Johann Bernoulli, Leonard 
Euler, Joseph Fourier, and finally, by Gustav Dirichlet in the middle 
of the nineteenth century with a formulation pretty close to what we 
are using at the present time and which a mathematics or engineering 
student meets in an introductory calculus course. Dirichlet 
introduced a variable, as a symbol that represents a set of numbers; 
if two variables x and y are so related that whenever x takes on a 
value, there is a value y assigned to x by some rule of cor- 
respondence. In this case y (a dependent variable) was said to be a 
function of x (an independent variable). 

In this section we introduce a more contemporary notion of a 
function. For functions operating with sets (rather than with points), 
we use a nontraditional notation of f, and f* (instead of just f), 
which we found very appealing as it brings more order within func- 
tions acting on collections of sets (such as topologies and sigma-al- 
gebras) and simplifies many proofs. 


2.1 Definitions. 


(1) Let X and Y be two sets. The set {(z,y): rE X, ye Y} 
of all ordered pairs of elements of X and Y is called the Cartesian or 
direct product of X and Y and it is denoted by X x Y. If X = Y 
then we write X x X = X?. Similarly, the Cartesian product of n 
sets, 


Ay he ® Xy Hl 5a) t Wr SE Ay .+5 On © Xa}, 


is the set of all ordered n-tuples. 
(ii) Any subset f of X x Y is called a binary relation. 


(iit) A binary relation f C X x Y is called a single-valued 
function (or just a function) if whenever (x,y,) and (x,y) are 
elements of f, then y; = y2. We also say that the function f is a 
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map (or mapping or transformation) from X to Y and denote this 
most frequently by the triple [X,Y,f] or by f:X — Y or by 
(x, f(x)) or by y = f(x) orbyx > f(a). 

Note that, although both X and Y are components of triple 
[X,Y,f], some pairs (2,y) from X x Y need not belong to f 
whether f is a function or a binary relation. 


(iv) For a function f (as a subset of X x Y), denote 
Dp= {ae X:@,y) SF} 


and call it the domain of f. When a function [X,Y, f] is given, its 
domain is generally not specified. The latter requires a more rigorous 
motivation. For instance, let X = Y =R and f(x) = V2? —-1. 
Then, Dr = (—00, — 1] U [1, 00) and for any x € ( — 1,1), there is 
no mate from Y under f. We can say that f is defined only for 
zeED fe 


In another example, let f(7) = Te This function is defined for 


all x € (1,00) if X = Y = R. However, if Y is the extended real 
line R = RU {+00, —oo}, we allow x € [1, ov]. And finally, it is 
not wrong to have x be any real (or even complex) number, if f will 
take on values in Y C C (or C = CU foo}). 


(v) Another component of a function is its range, 
Re={yeY: dre Dy, f(z) = y}. 


A superset of Ry (such as Y) is referred to as a codomain. In other 
words, f; is the subset of all such elements of Y, which take part in 
the relation f C Dy x Y. 


(vi) If x € Dy, then f(x) (€ Re) is called the image of x under 
f. By the above definition of a single-valued function, for every x, 
there is a unique image f(x). (An “extended” concept of a function 
allows more than one image of each point x under f. Any such 
function f is called multivalued. The reader is definitely acquainted 
with principles of complex analysis where such functions are 
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common. It is also known that in this case the range of a multi- 
valued function can be partitioned into pairwise disjoint subsets, 
such that the function is then split into a number of single-valued 
functions called branches.) 


Note that in the primary definition [X, Y, f] of a function most 
often Y is a codomain and X is a domain of f. However, in some 
cases, X is not a domain of f, which means that for some x €_X, 
there is no mate y € Y such that («,y) € f. In the latter case we 
call [X,Y , f] inconsistent. For instance, in the above example, the 


function IR, R, Vx? — 1| is inconsistent, because there are no mates 


y € Y for all x € (—1,1). If we try to fix this problem by a mini- 
mal invasion, namely assigning the empty set @ to any such point 
(i.e., setting f(a) = O), we are up against an inconsistency, for f(x) 
is suppose to be a point, not a set. The following construction of f.. 
better deals with this predicament. A function [X, Y, f] is consistent 
it X = Dy»: 


(vit) If A C Dy then the set of the images of all points of A un- 
der f is called the image of A under f and, following the notation of 
most analysis textbooks, it can be denoted 


f(A) ={y eY¥: dae A, f(x) = y}. 


However, for the upcoming constructions, it is convenient to distin- 
guish images of points from images of subsets of X under f. In other 
words, we introduce the function [P(X),P(Y), f.], where for 
A € P(X) we denote 


f(A) ={ye Y: dre A, f(x) =y} 


Specifically, Ry = f.(D,). In reference to inconsistency in (vi), we 
agree to set f,({z}) =@ Va ¢ Dyor more generally, f,(A) = @ 
whenever A ¢ P(Dy). With this add-on, [P(X), P(Y), f.] becomes 
consistent. The latter alone makes f,, superior to f. 


In some forthcoming applications, we also use the notion of func- 
tion f,, defined on P(P(X)) and with P(P(Y )) as its codomain. 
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(vitt) Let [X,Y,f] be a function. In its “ideal” construction 
call [P( Ry), PDs), f*] the inverse of f,.. In other words, for each 
set Be P(Ry,), the inverse image f*(B)={x eX: f(x) € B}. 
The set f*(B) is called the inverse image of B under f, or the pre- 
image of B under f. 


In a more general setting we define the inverse of f, as 
[P(Y), P(X), f*] bearing in mind that f*(B) = Gif B ¢ P(Ry,). 


Another construction related to f* and often used in analysis is 
f-' defined as {(y,z) € Y x X : (a, y) € f} and called the inverse 
of f. Unlike f*, f~+ need not be a single-valued function (in other 
words, it is a binary relation or multivalued munetion) Consider, for 
instance, the function [R, R, tI) such that f(x) = x*. Clearly, Ry 
= R, and the inverse if = f-! of f is a two-valued furiction with 


domain D; =R, and with range equal R, which can be 
decomposed as R = (—oo, 0) + [0, 00). Accordingly, we have two 
branches [IRi, (—co, 0), J jand[R,, Ri, Jf lot /-. 


In light of the function [P(P(X)),P(P(Y)), fax] we likewise 
consider its inverse [P(P(Ry)), P(P(X)), f™]. 


(72) Observe that it is legitimate for a single-valued function to 
have f(x,) = f(a) and x, # xy. However, if f is such that f (21) 
= f (x2) if and only if x; = x2, then f is called one-to-one or injec- 
tive or injection. If f is injective, f~' is a single-valued function too. 
For example, the map [P(R;), P(D,), f*] is injective. (See Problem 
2.8.) 


Because f~! in general is not a single-valued function we agree 
to regard f~'(y) as a set (which in particular can be a singleton or 
the empty set), with the alternative notation f*({y}). 


(x) Let LX,Y,f] be a function. Generally, f.(X) = Rp CY. 
In this case, we say the map f is from X into Y. When f,(X)=Y, 
we say the map f is from X onto Y or surjective or surjection. We 
call f bijective or bijection if f is both injective (one-to-one) and 
surjective (onto). 
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(xi) Let f CX x Y and g CY x Z be two binary relations. 
Then the composition (or composite) of f with g is defined as 


gof ={(x,z) © X x Z: Ay: (x,y) € f, (y,z) € g}- 
The composition of f with g is most frequently used when [X, Y, f'] 


and [Y, Z, g] are two functions and with Ry ND, £ O. 


(xii) A consistent function [X,Y,J], with X CY, is called a 
unity or identity function if for alla € X, I(x) = x. O 


2.2 Example. For a fixed subset A C X, define the indicator 
function as a consistent map [X, R, 14] with 


1, zEA 
1(a) = { 4 re AS. 


Then, [X,R,1,] is an into map, and the map [X, {0, 1}, 1,4] is sur- 
jective. O 


2.3 Definition. Let f: X — Y and let A C X. Then define 


Resaf = {(z,y) € (Ax Y)N ff}. 


This function is called the restriction of f to A. On the other hand, 
the function f is called an extension of the function Res 4 f. 


In general, under a consistent extension of [X,Y,f] we 
understand an expansion of X to a superset X’ and some reasonable 
definition of f on X’\X. We come across this problem in Chapter 5 
regarding an extension of a set function. O 


2.4 Example. Consider [IR, [ — 1,1], sin] which is surjective 
(onto) but not injective (one-to-one). Take a restriction of function 
[R, [—1,1], sin] to one of the largest subsets A of R where 
[IR, [ — 1, 1], sin] is monotone increasing. It is plausible to set A = 
[ 


aT 7 


— 5,4], because this set is also symmetric about the Y-axis. Then 


[A, [—1, 1], Res,sin] is obviously bijective and its inverse is the 
well-known function [[ — 1, 1], A, arcsin]. O 
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2.5 Remark. Let [X,Y, f] be a single-valued function such that 
for some y € Ry, f*({y}) = {x1, v2, v3} C X. Consider the com- 
position f, o f* and find that 


feo f (yh) = fai, v2, 03}) = ty}. 


Thus, if f is single-valued, the restriction of fo f~' to Ry is the 
identity function with the domain Dy, 7-1 = Ry. However, f he F 
need not be a single-valued function at all (show it). f-'o f is the 
identity function only when f is injective. O 


PROBLEMS 
2.1 Find the image of [—3, 5) under 1(; 9. 
2.2 Find the inverse image of (5, 4] under 1(; 3. 
2.3 Composition: 
a) Show that the composition operation © is associative. 
b) Show that (go f)-! = f-tog™t. 
c) Show that Dgor = Dy M f*(Dy). 
2.4 Show the equivalence of the following statements: 
a) f is injective. 
b) f(AN B) = f(A) f(B). 


c) For every pair A and B, of disjoint sets, f,(A) 9 f.(B) 
= @. 


What can go wrong with b) if f is not injective? 


In the following problems we assume that f is a map from X = Dy 
into Y. 
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2.5 


2.6 


2.7 


2.8 


2.9 
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Show that AC X = AC f*(f,(A)). 
Show that VB CY, f.(f*(B)) CB. 


Show that [X, Y, f] is surjective if and only if 
f.(f*(B)) = B, VBCY. 


Show that the map [P(Ry), P(Dy), f*] is injective. 


Is the map [P(D;), P(Ry), fx] injective? 


2.10 Is the map [P(Ry), P(Ds), f*] surjective? 
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3. SET OPERATIONS UNDER MAPS 


The most remarkable property of the inverse of a function is that it 
“preserves” all set operations (see Problem 3.6). The function itself, 
as we show, does not. The main theorems in this section are proved 
for special cases of surjective maps; the rest are left for the reader. 
Also, without loss of generality, consistency is assumed throughout. 


3.1 Theorem. Let [X,Y , f] be a surjective map and let B CY. 

Then 
[P*(B)P = f(B). 

Proof. We prove an equivalent statement, f*(B)+ f*(B°) = 
X, that is, we show that 

(i) f*(B) and f*(B°) are disjoint and (ii) f*(B) complements 
f*(B°) up to X. 

We start with 

(i) Suppose f*(B) and f*(B°) have a common point x. Then 
there is y; € B such that f(x)=y, and y2 € Be such that 


f(x) = yo. Thus, y, # ye and f is not a single-valued function. (See 
Figure 3.1.) 


Oh 


Figure 3.1 
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(it) If f*(B) does not complement f*(B°) up to X, there will be 
at least one point x that does not belong to either of these sets (for 
they are disjoint as shown above). This is an obvious contradiction, 


because it follows that f(a) ¢ Y. (See Figure 3.2 below.) O 
f(x) 
xX. © 
BS 
Y 
xX 
Figure 3.2 


3.2 Example. Let [X,Y,f] be a function. Then [f*(Y)]° = 
X° =@. On the other hand, setting B= Y, by Problem 3.1, we 
obtain 


PY") = f°); 


that is, f*(O) = @. O 


3.3. Theorem. Let [X,Y,f] be a surjective map. Then 
B, C By CY implies that f*(B,) C f*(Bo). 


Proof. Suppose that f*(B,) is not a subset of f*(B2). This 
implies the existence of a point « that belongs to f*(B,) and does 
not belong to f*(B2). Therefore, there is exactly one point y € B, 
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with f(a) = y. On the other hand, because x ¢ f*(B2), f(x) cannot 
belong to By). But it must, because f(x) = y € B, C By. (See 
Figure 3.3 below.) Hence, our assumption above was wrong. O 


Figure 3.3 


3.4 Theorem. Let f: X — Y be an onto map and let {B;: 
i € I} bean indexed family of subsets of Y. Then, 


(UB) = Ur@o. 


ier ie 
Proof. 
(i) We prove that L) f*(B;) C f* ( U Bi). 
tel tel 


Let x € LU f*(B;). Then there is an index t9 € J such that 
iel 
De 77 (B,,). Because. 5; C | )B;, by Theorem 3.3, f*(8,,) c 
wel 


io 


wel 


si ( U Bi) , which implies that x € f* ( ie Bi) 
ier 


(ii) We show the validity of the inverse inclusion, 
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(Ua) CUD, 


iel ie. 


Let x € f* ( UB). Then f(x) € JB;. Therefore, there is an 
iel iel 

index ig € J such that f(x) € B;, if and only if {f(x)} C Bj,. By 

Theorem 3.3, it follows that f*{f(ax)} C f*(B;,). Because x € 


f* ({f(x)}), we have 


{x} © f*{F(e)} © f*(Ba) © U (Bi). o 


PROBLEMS 


3.1 Prove Theorem 3.1 under the condition that f is an into map. 
Let |X,Y, f] be an into map and let B C Y. Then 


[f*(B))" = f*(B*). 


3.2 Prove Theorem 3.3 under the condition that f is an into map: 
Let [X,Y , f] be an into map. Then B, C By C Y implies that 
f*(Bi) & f*(B2). 


3.3 Generalize Theorem 3.4 when f is an into map: Let f: X — 


Y be an into map and let {B;: 7 € I} be an indexed family of 
subsets of Y. Then, 


r(ya) = ys 


3.4 Let [X,Y,f] be an into map and let {B;:i¢€J} be an 
indexed family of subsets of Y. 


a) Prove that f* ( () Bi) =[\7"(B. 


iel ie. 
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3.5 


3.6 


au] 


3.8 


b) If By, and By are disjoint, show that their inverse images 
under f are also disjoint. 


c) If {B;: i € I} is a pairwise disjoint family, show that 
f* (ier Bi) — Diert (Bi). 
Show that f*(A\B) = f*(A)\f*(B). 


The results above prove that all set operations are closed under 
the inverses of maps. Show that not all set operations are 
closed under maps as per the following. 


a) Show that maps preserve inclusions. 


b) Show that maps preserve unions. 


c) Show that maps do not preserve intersections; specifical- 
ly, show that 


and that the inverse inclusion need not hold. Explain the 
latter with and without a counterexample. 


d) Do maps preserve the difference? 


Let [X, Y, f] be a map and let A C Y. Show that 
1, (e) Tt) = 1 p-(4)(2). 


Prove the following properties of the indicator function defined 
on a nonempty set (2. 

(i) Lanps = min{1,, 1p} = Llp. 

(iz) l4aug = max{1y, 1p} =1,+ 1p - Lapp. 

(itt) layp =144+ 1p. 


(iv) Dy = icra: 
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3.9 


3.10 


3.11 


3.12 


3.13 
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(v) Ige =1—1y. 
(vi) ACB > 14<\1p. 


(vii) Iya, = sup{la: @€ I}, Ip a, = inf{14: 7 € Th. 
ve ve 


Let {A,,} be a sequence of subsets of 2. Show that the 
function lim1 4, is the indicator function of the set lim A, (.e., 
I~ oo 4 ) and that the function lim 14, is the indicator function 


m 
n=1lm=n 


of the set lim A), (i.e., Ix os i 


m 
n=l1m=n 


Prove that, lim | A, exists if and only if lim. 14, exists. 


Let LX, X’, F’] be a bijective map and let 7 and 7’ be respec- 
tive collections of subsets of X and X’ such that 


PCr VC? and Lair) Ge. 
[See Definition 2.1 (vii) and (viii) ]. Show that 
Pr) =rand Fr) =. 


Let [X,Y, f] be a consistent map. Denote F' = f, o f*. Then, 
the map [P(Y), P(Y), F'] is such that F’ = I, (J is the identity 
map) if and only if [X, Y, f] is surjective. 


Let [X, Y, f] be a surjective map. Show that [P(X), P(Y), fi] 
is surjective. 
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4. RELATIONS AND WELL-ORDERING PRINCIPLE 


In Definition 2.1(iz) we introduced the concept of a binary relation R 
as an arbitrary subset of A x B. In the special case when R C A x B 
and A = B, we call R a binary relation on A. We sometimes use as 
notation aRb instead of (a,b) € R. This notation makes sense, for 
instance, if R is stipulated by < or < on some set. In addition, we 
also say that a pair (X,R) is a binary relation, where in fact R is a 
binary relation on a set X (a carrier). Now we consider some special 
relations. 


4.1 Definitions. Let R be a binary relation on X. 
(i) Ris called reflexive ifVa € X, (a,a) ER [aRal. 


(iz) R is called symmetric if (a,b)€R => (b,a) ER [aRb 
= bRal. 


(ii) R is called antisymmetric if (a,b), (b,a) ER > a=b 
[aRb A bRa > a=O)). 


(iv) R is called transitive if (a,b), (b,c) ER => (a,c) ER 
[aRb A bRc => aRcl. 


(v) Ris called an equivalence on X (denoted by symbol * or 
E) if it is reflexive, symmetric, and transitive. 


(Observe that the equivalence E on X partitions X into mutually 
disjoint subsets, called equivalence classes. A partition of X is a 
family of disjoint subsets of X whose union is a decomposition of X. 
The elements of X “communicate” only within these classes. In sum- 
mary, every equivalence relation generates mutually disjoint classes. 
The converse is also true: an arbitrary partition of the carrier X gene- 
rates an equivalence relation. ) 


(vi) R is called a partial order (denoted by the symbol ~ ) if it 
is reflexive, antisymmetric, and transitive. 


(vii) If < is a partial order, it is called /inear or total if every 
two elements of X are related or comparable; that is, Va,b © X 
eithera <x borb~x<a. 
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(viiz) Let X be an arbitrary set and let E (=) be an 
equivalence relation on X. For t € X denote 


[tle (= [2] ~) = {y € X: yEx} 


and call it an equivalence class modulo E ( = ). The set of all equiva- 
lence classes 


XIE = X\x (or {[x]z}) 


is said to be the quotient (or factor) set of X modulo E (or & ). It is 
easily seen that a quotient set of X is also a partition of X. 

Note that z ++ [x]p is a function assigning to each x € X, an 
equivalence class [x]. We denote this function most often by 7g (or 
sometimes 7~) and call it the projection of X on its quotient by E (or 

~ ). In other words, for each x € X, 7(x) = [z],. CO 


4.2 Examples. 


(i) (IR, =) is an equivalence relation. Therefore, every real 
number as a singleton represents an equivalence class. 


(it) (IR, < ) isa linear order. 


(iit) Congruent triangles on a plane offer an equivalence relation 
on the set of all triangles. [Two sets A and B are called congruent if 
there exists an “isometric” bijective map f: A — B; ie., f must 
preserve the distance for every pair of points a,b € A and their 
images f(a), f(b) € B| 

(iv) (R?, <) is not a linear order if we define “<” as 
(a1, 61) < (do, by) if and only if ay <a A 6b, < by. To make this 
relation a linear order we can define, for instance, (a1, 61) < (aa, be) 
if and only if ||(a1, 61)|| < ||(@2, b2)||, where ||(a, b)|| is Euclidean 
distance of point (a, b) from the origin. 


(v) Let | be the relation on N such that n | m if and only if n 
divides m (without a remainder). It can be shown that (N,| ) is a 
partial order but not a linear order. (See Problem 4.5.) 
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(vi) Let p be a fixed integer greater than or equal to 2. Two 
integers a and b are called congruent modulo p if a — b is divisible 
by p (without remainder); in notation we write p|a—b ora=b 
(mod p). The number p is called the modulus of congruence. Define 


[m], = {n € Z: m =n (mod p)} (m € Z). 
In other words, 
[m]p ={n € Z: Ik EZ: n=kp+ mb}. 


Then any two integers m and n are related in terms of [- ],, if and 
only if n € [m],. This is an equivalence relation. (Show it; see 
Problem 4.1.) 


(viz) Let X be a nonempty set and RC X x X be a binary 
relation. Taking for R the diagonal D = {(x,x):x € X} we have 
with (X, D) the “smallest” (by the contents of elements of X x X) 
equivalence relation on X, where each element forms a singleton- 
class, and D partitions X into the “number” of classes corresponding 
to the quantity of all elements of X. The “largest” equivalence 
relation on X is obviously R= X x X itself and it consists of the 
single class. 


(viii) Any consistent function [X,Y, f ] (ie., a function with 
X = Dy) generates an equivalence relation on its domain X 
partitioning X into disjoint subsets. Define the binary relation Er 
(& ¢) on X as 


rE peg => f(@1) = f (2). 


Then, it is readily seen that Ey is an equivalence relation on X, 
referred to as the equivalence kernel of function f. Formally, for 
every point y © Ry = f,(X), the preimage f*({y}) is an equiva- 
lence class in X and X|Er = {f*({y}):y € Ry} is the quotient set 
of X modulo Ey. Furthermore, 5° f*C{y}) is a decomposition of 


xe 


yeRy 
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For instance, the function f(x) = x? generates a partition of R 
into a collection of subsets of the form { —a,a}, for a > 0, along 
with {0}, which is a factor (quotient) set of R modulo E,». 


Another example is the function 


X=R\j 2) ne ZR tanl. 
x=R\{ JR, tan] 


Let 
A, = tan*({y}) = {arctan y + m:n € Z} = [arctan Y Bien 


Then, Etan is the equivalence kernel of the function tan and 


X [Eun = {tan*({y}):y € R} 


is the quotient set of X modulo E;,,. Furthermore, 


X = Ver Ay: O 


The last discussion about an equivalence relation generated by a 
function yields some important results and notions we use in the 
forthcoming materials of Chapter 6. 


We demonstrated in Example 4.2(viiz) that any function on X 
generates an equivalence relation. However, the following proposi- 
tion states that the converse is also true, namely that any equivalence 
relation E is the equivalence kernel of some function, as per Propo- 
sition 4.3. 


4.3 Proposition. Let E be an equivalence relation on a nonempty 
set X. Then the projection map [X,X|E,7] (defined as 
M(x) = [x],) is a surjective map with E being the equivalence 
kernel of Tg. O 


Proof. From the definition of 7 it follows that 7g is surjective. 
To claim that E is the equivalence kernel of 7g, we need to show that 
Te(21) = TE(X2) if and only if 7; Ex. 


Let mp(x1) = 75(22). Because xEx, x € [x]p and thus, by the 
assumption 7_(21) = 7p(%2), 21 € [X2[p. This proves that 7, Ex». 
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Now let 2Ex3. If x € [xi]p, then x2Ex, and thus, by tran- 
sitivity, x 2Ex3; that is, x2 € [x3]p. Therefore, [21]p C [x3]z. The 
inverse inclusion, and thus the equality, is due to the symmetry of E. 
Hence, Te (21) = TE(x3). O 


Proposition 4.3 asserts that the projection 7g is an example of a 
surjective function defined on X and with the range X|E. 


Now suppose E is an equivalence relation on a set X and 
[X,Y,f] is any function whose equivalence kernel is E. The 
following theorem claims that there is a unique “intercessor” f 
between the quotient set X|E and the codomain Y of f. 


4.4 Theorem. Let [X,Y, f] be a consistent function with Er 
being its equivalence kernel. Then, there is a unique function 
[X|Es, Y, f] such that f = f o 7,. O 


The reader shall be able to take care of this theorem (Problem 
4.10) as well as of Corollary 4.5 (Problems 4.11 and 4.12). 


We call f the reducer of f. 


4.5 Corollary. Let [X,Y, f] be a function and let Ey denote its 
equivalence kernel. Then: (a) there is a unique reducer function 
[X|Er, Y, f] of f such that f is injective and f can be represented 
as a composition f = f om, of [X, X|Ef, TE,] and [X|Ez, Y, f]; 
(b) if f is surjective, the reducer f is bijective. O 

4.6 Example. To illustrate Theorem 4.4 and Corollary 4.5 


consider [R, | — 1, 1], cos]. If Ecos denotes the equivalence kernel of 
the cosine function, then 


R[Ecos = {cos*{y}: y € [— 1, 1]} 


is the quotient set of X = R modulo Egos such that 
cos*{y} = { tarccosy + 2mm: ne Z} 


and 
TH, (£)= [t] E06 ={+274+2m:neEZ} 
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= cos* {cosa} 


making [IR, R|Ecos, 75,,,| the projection of X = R on its quotient by 
Ecos. The reducer f of f is denoted by [IR|Ecos, | — 1, 1], cos] and by 
Corollary 4.5, it is bijective. Also, by Corollary 4.5, 


COS 0 7g,,.(@) = cos( + x+2mn:n € Z) =cosz. O 


Now we turn to a discussion on the partial order relation and all 
relevant notions and theorems, which we apply throughout the rest of 
the book. 


4.7 Definitions. Let (A, = ) be a partial order and let B C A. 
Clearly, (B, < ) 1s also a partial order. 


(i) The partial order (B, ~ ) is called a chain in (A, X ) if it 
is linear. 


(it) An element bo € B is called a minimal element of B 
(relative to =< ) if for each b € B with b = by, b = bo (compared 
with the smallest element by, whichis ~ b for all b € B). 


(iit) An element b,, € B is called a maximal element of B 
(relative to =<), if for each b € B, with by ~X b, it holds true that 
b=bx (compared with the largest element b.,, which is such that 
b < ba Vb E€ B). 


(Observe that the difference between a minimal element and the 
smallest element of a set is as follows. A minimal element bo is 
<b € B whenever bp is comparable with some b. In addition, the 
smallest element is comparable with all elements of B.) 


(iv) An element u € A is said to be an upper bound of B if 
bx ub € B. Anelement / € A is said to be a lower bound of B if 
|< b Vb € B. If B has lower and upper bounds then B is called 
bounded (or x -bounded). 


(v) Ifthe set of upper bounds of B has a smallest element up 
then this element is called the /east upper bound of set B [abbrevi- 
ated lub(B)] or supremum [sup(B)]. Similarly, if the set of all lower 
bounds has a largest element /., then it is called the greatest lower 
bound of the set B [in notation glb(B)] or infimum [inf(B)]. 
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[For instance, 0 is the glb((0, 1)) or inf(0,1) in (R, < ), whereas 
an lub of the set [1, \/2] MQ does not exist in (Q, < ).] 


(vi) Let B contain at least two points. The partial order (B, ~ ) 
is called a /attice if the supremum and infimum of every two-element 
subset of B are also elements of B. In notation: if A = {x,y}, then 


xVy: =sup{x,y} = sup(A) € B 
and 


xAy: = inf{x, y} = inf(A) € B. 


The symbols V and A are referred to as join and meet lattice opera- 
tions, respectively. O 


4.8 Examples. 


(a) Let = 41,3537 ney 3") oc fe Then OB, |) [where | ais 
the relation in Example 4.2(v)] is a chain in (N, | ). 


(i7) Let B = {2,3,4,...} and consider the relation | on B. In 
terms of this relation, the set of all prime numbers {2, 3,5,7,11,...} 
is the set of all minimal elements, and there is no smallest element in 
B, because there is no minimal element related to all other elements. 
B does not have a maximal element either. 


(277) Consider the partial order (P(Q), C ). It is obvious that for 
an arbitrary indexed family A = {A; CQ:i€ 1} C PC), it is true 
that sup A = [J A; € P(Q) and inf A = f) A; € P(Q). In particular, 

iel tel 
it holds true for pairs of subsets. Thus, (P(Q), C ) is a lattice. O 

4.9 Definition. A linear order (A, ~ ) is said to be well-ordered 
if every nonempty subset of A has a smallest element in the sense of 
the same order < . 0 


4.10 Example. Let R be the set of all real numbers and consider 
the relation (IR, < ) which is clearly a linear order. However, R is 
not well-ordered by < , for there are nonempty subsets containing 
no smallest element, such as (0,1). But (N, < ) is well-ordered. O 
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Can all sets be well-ordered? This is one of the fundamental 
questions in set theory posed by Georg Cantor in the 1870's. Cantor 
considered it obvious that every set can indeed be well-ordered. At 
that time set theory was not well-established yet. In 1908, Ernst 
Zermelo formulated his axiom of choice and showed in his paper, 
“Untersuchungen tiber die Grundlagen der Mengenlehre” (see [Zer]) 
that the axiom of choice is equivalent to the “well-ordering prin- 
ciple.” The axiom of choice was included in an axiom scheme for set 
theory that was later (1922) strengthened by Adolf Abraham Frankel 
in his paper, “Zu den Grundlagen der Cantor-Zermeloschen Mengen- 
lehre” (see [Fran]). 


Zermelo and Frankel introduced the following notions. Let S be 
a collection of sets. A function c defined on S is called a choice 
function, if for each X € S, c(X) € X. In other words, c assigns to 
each set exactly one element of the set. Or less formally, we can 
choose exactly one element from each set. Observe that if S is an 
indexed set, that is, S = {X;:i € I}, then we have f(i) = c(X;) € 
X;. The axiom of choice is formulated in this way: 


Every system of sets has a choice function. 


Zermelo proved that a nonempty set A can be well-ordered if and 
only if its power set P(A) has a choice function. (There will be a 
short discussion of the axiom of choice in the upcoming sections. ) 


4.11 Theorem (Zermelo). The axiom of choice is equivalent to 
the well-ordering principle. O 


4.12 Examples. 


(7) To illustrate a use of the axiom of choice, consider the 
following example. Let [X,Y, f] be an onto map. We show that 
there exists a subset A C X such that Res, f: A — Y is bijective. 
Let c be a choice function for the factor set {f*({y}): y € Y} of X 
modulo E;. Then the set A = {c(f*({y})): y © Y} has the desired 
property. In other words, we choose one x from f*({y}) for each y 
and the collection of all these z's is A. 
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(it) Let A= {c(tan*{y}) = arctany:y € R}. Then, set A is 
A= (— 4,4) and hence [A, R, Res tan] is a function such that it 
is injective and (Res tan)! = arctan. oO 


One of the central results in set theory is Zorn's lemma [1935] 
belonging to a German-born American mathematician, Max August 
Zorn (1906 — 1993), which is widely used in set theory and which is 
also equivalent to the axiom of choice. 


4.13 Lemma (Zorn). [f each chain in a partially ordered set A 
has an upper bound, then A has a maximal element. 


Another very useful result is Tukey's lemma (named so after the 
American mathematician and statistician John Tukey), which follows 
from Zorn's lemma. Tukey's lemma is sometimes referred to as the 
Teichmiiller-Tukey lemma. 


4.14 Lemma (Tukey). Let 2 be a nonempty set and F C P(Q) 
be a family of subsets of Q such that A € F if and only if each finite 
subset of A is in F. Then, F has a maximal element A in the sense 
of C, that is, there isno B € F such that Ax CB. 


PROBLEMS 


4.1 Show that the relation in Example 4.2 (vi) is an equivalence 
relation on Z. Give the equivalence classes for p = 4. 


4.2 Classify the following binary relations. 
a) Let 2. be a nonempty set. Define the relation (P(Q), C ). 
b) Let Q = R?\(z, 0). Define R: (a, b)R(c,d) = ad = be. 


4.3 The following theorem is a statement of the principle of math- 
ematical induction. 


Let S(n) be a statement which is true or false, for n= 
1,2,... . Let S(1) be true and suppose whenever S(n) is true 
implies that S(n +1) is true, n=1,2,... . Then S(n) is true 
for all n. 
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4.4 


4.5 


4.6 


4.7 


4.8 


4.9 


4.10 


4.11 


4.12 
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Prove it. (Hint: Use the well-ordering principle.) 
21 
Prove that }7, 7° = gn(n+1)(2n +1). 


Show that (N, | ) in Example 4.2(v) (defined as: n | m if and 
only if n divides ™) is a partial order relation. 


Let 1° denote the space of all summable real-valued sequences. 
In other words, a sequence [N, R, f], in the form of {a,}, is 
summable, if the series ear converges. We define the 
partial order = onl’ as f < g if and only if f(n) < g(n) for 
alln EN. Is (1°, = ) a lattice? 


Let Monn) be the set of all non singular n x n matrices. We 
define the partial order =< on Monn) sA=(,)3 B= 
(b;;) if and only if and only if aj; < b;; for all i and 7. Is 
(Mi, oe ) a lattice? 

Is the set of all continuous real-valued functions a lattice? 


Is the set of all real-valued polynomials a lattice? 


Prove Theorem 4.4: Let [X,Y , f] be a consistent function with 
E; being its equivalence kernel. Then there is a unique 
function [X|Ez, Y, f| such that f = f 0 7p,. 


Prove part (a) of Corollary 4.5: Let [X,Y, f] be a function 
and let Er denote its equivalence kernel. Then there is a 
unique reducer function [X|Es,Y,f] of f such that f is 
injective and f can be represented as a composition 
f = fom, of [X, X|Es, 7,] and [X|E;,Y, f]. 


Prove part (b) of Corollary 4.5: Let [X,Y, f] be a function 
and let Ey denote its equivalence kernel. Then if f is 
surjective, the reducer f is bijective. 
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4.13 


4.14 


4.15 


4.16 


For the function [R, | — 1, 1], sin] render a similar analysis as in 
Example 4.6: give the equivalence kernel, the quotient set, the 
projection of R on its quotient set, and the reducer. 


For the function [[—7,7],|[—1,1],sin] render a similar 
analysis as in Example 4.6: give the equivalence kernel, the 
quotient set, the projection of [| — 7, 7] on its quotient set, and 
the reducer. Is the reducer bijective? 


For the function [IR, [ — 1,00), f(x) = x? — 1 render a similar 
analysis as in Example 4.6: give the equivalence kernel, the 
quotient set, the projection of R on its quotient set, and the 
reducer. 


Let [es Cy, Dt denote the differential operator [D! f(z) is 
defined as 4 f(z) ] on the set of all real-valued continuously 
differentiable functions. Give the equivalence kernel of D', the 


quotient set, the projection of ‘ds ) on its quotient set, and the 
reducer. Is the reducer bijective? 
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NEW TERMS: 


binary relation on a set, 29 
- reflexive, 29 
- symmetric, 29 
- antisymmetric, 29 
- transitive, 29 
- equivalence, 29 
- partial order, 29 
- linear (total) order, 29 
comparable elements, 29 
[]z, equivalence class modulo E, 30 
XE, quotient (factor) set, 30 
factor (quotient) set, 30 
Tg, projection of a set on its quotient, 30 
congruence, 31 
congruence modulo p, 31 
modulus of congruence, 31 
equivalent classes generated by a function, 31 
equivalence kernel of a function, 31 
[X|E;, ¥, J], reducer, 33 
chain, 34 
minimal element, 34 
smallest element, 34 
maximal element, 34 
largest element, 34 
upper bound, 34 
lower bound, 34 
bounded set, 34 
least upper bound (supremum), 34 
supremum, 34 
greatest lower bound (infimum), 34 
infimum, 34 
lattice, 35 
join operation, 35 
meet operation, 35 
well-ordered set, 35 
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Cantor, Georg,, 36 
Zermelo, Ernst, 36 

axiom of choice, 36 
well-ordering principle, 36 
Frankel, Adolf Abraham, 36 
choice function, 36 
Zermelo's theorem, 36 
Zor, Max, 37 

Zorn's lemma, 37 

Tukey, John, 37 

Tukey's lemma, 37 
principle of mathematical induction, 37 
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5. CARTESIAN PRODUCT 


The idea of the Cartesian product (or, equivalently, direct product) 
primarily belongs to René Descartes who introduced this notion for 
two sets X and Y as a set of all ordered pairs {(x, y): 7 € X and 
y € Y}. Descartes was also the one who introduced the widely used 
Cartesian coordinate system related to the Cartesian product. 

In Definition 2.1, we introduced the notion of the Cartesian 
product of finitely many sets. We extend this definition to arbitrarily 
many sets. We begin with sequences of sets. 


5.1 Definitions. 


(i) Let {Y;: 7=1,2,...} be a sequence of arbitrary sets. Then 
the Cartesian product of this sequence is the set of all sequences 


wo 
LY =A igtas aj € Y;, i= I 2, see 
n=1 


of elements from Y;, Yo,..., respectively. 


xX 


Figure 5.1 
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(iz) In the general case, let {Y,: « € X} be an indexed family 
of sets. 


Then the Cartesian product (see Figure 5.1 above) defined as 


ITY: ={f: X> U Ya: f(z) €¥2, 2 € X} 
LEX TEX 


is the collection of all functions defined on the index set X and 


valued in (J Y,. Each such function is a choice function for the 
rEX 

family {Y,: « € X}. For an x € X, the set Y, is called the xth fac- 

tor space. O 


5.2 Remarks. 


(i) One of the basic questions that arises is this: when is the 
Cartesian product nonempty? Obviously, if at least one factor space 
Y, = @, then [[ Y, = O. But if all Y,, 4 O, is the Cartesian product 

rex 
necessarily nonempty? Although the answer may seem obvious, we 
must turn to the axiom of choice. In other words, the Cartesian 
product of a family of sets is nonempty if and only if there exists at 
least one choice function for this family, which is guaranteed by the 
axiom of choice. 


(it) We said that the Cartesian product of the family of sets {Y;. 


: « € X} is the collection of all functions from X to J Y,. In parti- 
rex 

cular, if Y, = Y, for all x € X, then the Cartesian product is the 

collection of all functions naturally denoted by Y* [or by F(X; Y)]. 


5.3 Example. As an illustration of a use of the Cartesian product 
we show that given a nonempty set X, the power set P(X) is “equi- 
potent” with the set {0, i. So, let X be an arbitrary set. Then every 
subset A C_X can be associated with its indicator function 14. Con- 
versely, 


A= {x € X: 14(zx) = 1}. 
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Therefore, we can set a one-to-one correspondence between P(X) 
and the set of all indicator functions indexed with all subsets of X. 
On the other hand, the set of all such indicator functions is in fact the 
set of all (binary) functions of type G: X — {0,1}. [Indeed, if 6 is a 
binary function, {2 € X: G(x) = 1} = B is a subset of X. Thus, 
(G =18.] This set, by the above definition, is the Cartesian product 
of the family of sets Y, = {0,1}, where the index x runs X, in 
notation, {0,1}*. So, we have shown that P(X) is “equipotent” 
(i.e., in a bijective correspondence) with the set {0,1}* of all binary 
functions G: X — {0,1}. O 
5.4 Definitions (Projection Maps). 


(i) Let {Y,: x € X} bea collection of sets. Given an a € X, 
the map 


[TL ¥.,¥oo| 


rEX 


is called the ath projection map if 


Tal f) = f(a), 


where 


f ©] Yz and f(a) € Ya. 
LEX 


The point f(a) is called the ath coordinate of f and we recall that 
Y, is the ath factor space. (See Figure 5.2 below.) Observe that 


m ({f(a)}) A {f} but it contains { f}. 
For instance, if X = {1,..., n}, 


m({F()}) = {fC} x Yo x + x Yn. 


In general, 


({fia)}) =T] Ye, (5.4) 


crEx 
where 


Y,=Y,, forr#a, and Yo = {f(a)}. (5.4a) 
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Figure 5.2 


In a more familiar scenario of the 2-D Cartesian coordinate sys- 
tem the two figures below readily speak for themselves. 


Y, 


2 
A 


TL(f)=f@) # ~ = = « of 


| 
| 
| 
| 
| 
| 


e > 
Th (N=F0) 


as 


Figure 5.3 
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Y,4 TF} 


2 


fQ) y, 
Figure 5.4 


(it) Obviously, from (5.4-5.4a) we have 7*(Y,) =[[ Y:. In 
tEX 
particular, in the latter two-dimensional case, 7}(Yi) = Y1 x Yo. 


Now, if By C Yq, then 


where 
A AN 
Yee), 2 ee, od Ye =— By 


The subset 7*(B,) of [] Y; is called a unit cylinder with base 
LEX 
B,. (See Figure 5.5 below.) 
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Figure 5.5 


Figure 5.6 
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Notice that we should take pictorial illustrations like those in 
Figures 5.5 and 5.6 with extreme caution. For instance, if we take 
two disjoint sets A,, B, C Y,, then because inverse images of dis- 
joint sets are disjoint, so must be 7*(A,) and 7*(B,). However, 
their intersection will not look empty, for they will appear inter- 
secting on the entire plane except on Y,,. This is because all Y,,'s are 
nonoverlapping (which we need to remember whenever we come to 
draw a figure) and as depicted, they seem to form a solid area. We 
thus admit that these figures are useful, but imperfect, illustrations. 


In the conventional, two-dimensional Cartesian system, a unit 
cylinder 7}(B,) with base B, looks like shown in Figure 5.7. 


Ya 


2 


7; (B,) 


Figure 5.7 


(iit) Now, for a, 8 € X, let By C Yq and Bg C Yg. Then, ob- 
viously, 
A 
m,(Ba) 1 3(Bs) = 1] Pe, 


. rEX 
A A A 
whee, = Y,,.07 0, >,.and Y,—=By,¥ a= Bp: 


The set 7/(Ba) 9 73(Bg) is called a simple cylinder with base 
By x Bg. (See Figure 5.6.) This construction of a simple cylinder 
can be extended to any finitely many sets By,,..., Ba,,- 
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(iv) Let B, C Y,, x € X. The set 


I] Bs = 1) 7(Br) 


rex rEx 


is called a rectangle or parallelepiped. 


If A C X and 
Pn Bee Ys, zrEA 
a Mes ze xX\A, 
then the set 
AN 
[yes 8) (5.4b) 
TEX rEA 


is a (rectangular) cylinder with base [| By. A cylinder in (5.4b) is 
reA 
simple if set A is finite. 


Y, 
* 


Figure 5.8 
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In a conventional Cartesian representation of the Euclidean three- 
dimensional space, common in calculus-based geometry, with 
X = {1,2,3}, the factor spaces Y|, Y2, Y3 are drawn as mutually 
orthogonal axes. If we take A = {1,2} and B, and By as intervals 
from Y; and Y9, the cylinder with base B, x By will look like an 
infinite parallelepiped (with B; = Y3). In Figure 5.8 above, the 
perpendicular strips are just “traces” of 7}(B,) and 75(B2) on plane 
Y, x Yo. 


(v) Now we introduce a more general notion of a projection map. 
Let {Y,: 2 € X} be an arbitrary indexed family of sets and let 
AC X. Define 


I] x ; Il Vo > 7A 
rEX acA 


and call it the A-projection map if 74(f) = f.(A). Specifically, if 
A= {a} we have m,.4(f) = f«({a}) which, in contrast with 
definition (2), is a set, namely, a singleton. 


In light of Definition (iv), the A-projection map 7,4 can be used 
in representation (5.4b) of a rectangular cylinder with base || B, as 
xrEA 
n( la B.), where B,, is a subset Y,, for x € A. Recall that a rect- 
ZEA 
angular cylinder is simple if A is a finite subset of X. In particular, 
the rectangular simple cylinder depicted in Figure 5.7 (with base B, 
x Bp) can be expressed as Toy (Bi Bo). TEA = 1eigcs.gitn hs 


rectangular simple cylinder can also be expressed as 
2 oe Th f= 1 n 


(vi) With this more flexible notion of the A-projection map we 
can further generalize the notion of a cylinder. Let A C |] Y,. Then 
acA 
call 7%(A) an A-cylinder with base A. An A-cylinder is then rect- 
angular if A is a rectangle with respect to [[ Y., that is, A = [[ Ba 
acA acA 


for By = Fas 
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In particular, if A is a finite subset of X then the rectangular 
cylinder becomes simple as in Figure 5.6 above. A simple A-cylinder 
is a unit cylinder if A is a singleton. (See Figure 5.5.) O 


5.5 Examples. 
(4) Let A= {aj, a2,...,Qn}. Then, 


Wad ccm tt) = FAs sone Qn }) = {f(a1), eee) f(an)$; 


and hence, 


Tags san} F(a ++ FO) = Axa, (Fe) 


is an {a1,..., @,}-simple cylinder with base { f(a1),..., f(Qn)}- 


(ii) This example will be revisited in Section 5, Chapter 3. 
Consider as R® the collection of all real-valued functions on R regar- 
ded as the Cartesian product of R's. We select a “neighborhood” U; 
of a point f ¢ R®. First of all, a simple cylinder with base 
(yi — €1,Y1 t€1) X +++ X (YR — Ek, YR + Ex) Obviously has the form 


Moy (Oi = Big th A ea)] Ti+ Py, (Ge = eave + ee) y CA) 


where y,, is a point in Y, = R. This cylinder (in Figure 5.9 shaded 
with gray color including the red strips) becomes a neighborhood Us 
of f in the product space R®, if we replace y, with the correspond- 
ing traces f(a,) of f in the respective factor spaces Yq,,..., Ya,: 


Uy = m8, [(f (a1) — €1, f(a) tev] 


N mh, [Cf (ax) — €x, (ax) + €x)] - (5.5a) 
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Figure 5.9 


So, Uy is the collection of all choice functions from R® such that 
their traces on k fixed factor spaces Y,,,..., Yq, must lie within in- 
tervals (f(a1) — €1, f(ai) + €1),.--,(f (ai) —€1, f(a) +61), re- 
spectively, and with no further restrictions elsewhere. 


PROBLEMS 

5.1 For subsets A C Y; and B,C’ C Y9, prove that 
(2) 14x3(2,y) a 14(x)1 p(y), Ve € Ay eb. 
Gi) (Ax BY =(A4* x 4) UM x B®). 


Show the validity of the distributive law with respect to cross 
product and union, intersection, and difference: 


(iii) Ax (BUC)=(Ax B)U(AxC). 
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(iv) Ax (BNC)=(Ax B)N(AxC). 
(v) Ax (B\C)=(Ax B)\\(AxC). 


5.2 Prove that if A;, Ao € Y; and By, Bo € Yo, then 
(i) (Ay a) Ag) x (By M Bo) = (Ay x B,) a) (Ag x Bo). 


(it) (A, x By)\(A2g x Bo) = [(A1\A2) x By] U 
[A; x (B,\Bs)}. 


5.3 Is it true that 


(Ay a) Ag) x (B, M By) = (Ay x Bp) a) (Ag x B,)? 


5.4 On the other hand, using the distributive law in Problem 5.1 
(iti) it appears that 


(Ay M Ag) x (B, M By) 
= (Ay x B,) NM (Ag x By) 
MN (A, x By) NM (Ag x Bi). 


Is this correct? 


5.5 Find a short expression for 
(Ay a A» a A3) x (B, a) Bo). 
5.6 Prove that 


ie. ier 


(ii) Ul x B=U(Ai x B). 


el el 
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5.7 


5.8 


Fe 


5.10 
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ie. ie. 


Let 7 = [| ¥,; let Ay’ CY,, and let A=] Az, where A, 
rEX rEX 
= Y, except for finitely many values of the index x, say 


1,...,n © X. Show that A = () 1*(Ayj). 
k=1 


[o.¢) 
Let 2 = |] ¥,, and let Ay Cy, where for cach 2 = 1, 2, 
r=1 
.., the sequence of sets {A,,} is monotone nondecreasing 
(i.e. Ay, C Ao, C ---) with 


sup( Age? n= 12,25) = Age =e; 


n=1 


for x =2,3,.... Also assume that A ,, = Ao; = A3; 
... = Ay). Show that 


sup{ I Ang 1% = 1,2, et = 17(A}). 


Let A = A; x Ay and B= B, x Bo be subsets of Y; x Yo 
such that A;,B; C Y;,i = 1,2. Prove the validity of the de- 
composition 


A\B = [(A1\B1) x Aa] + [(A1M Bi) x (A2\Bz)). 


In other words, A\B can be partitioned as two disjoint “rect- 
angles: (A;\By) x Ag and (A, M By) x (A2\Be). 


Show the validity of yet another decomposition of A\B : 
A\B = [Ai x (A2\Bz)] + [(Ai\Bi) x (A2 Ba)). 


Let {A, C Y,; © € X} and {Z, C Y,;y € X} be two family 
of subsets. Show that 
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cEX yEex rEX 


a) (114.) 0 (1.2,) = T] (A,n B,). 


b) (14.)u (2-2 Tana 


vExX yEex rex 


5.11 Let Aj,..., Am C Yi and By,...,B, C Yo. Show that 


Aa] x [Ae] = A Aca « Bo. 
j=1 j=1 j=1 521 


5.12 Let {A;:i € I} and {B;: 7 € J} be two indexed families of 
subsets of Y| and Y2, respectively. Show that 


() B; 


jet 


= NA x Bi). 


tel jeJ 


nay] x 


ie 


5.13 Let oy : iz, € Ty} be an indexed family of subsets of 
¥,, 6 = 1p. ye Show that 


Ht] 9,50" 


k=1 | ipEl; 


=... (Tis\?). 


yeh t,€l, \k=1 
5.14 Letm,n Ee Nand Y 4@. 


a) For m<n, using a relevant projection map give a 
bijective function [P(Y™), P(Y”), - ]. 


b) Using a relevant projection map give a bijective function 
PO), PO"), «1. 
c) Give a relevant projection map [Y" x Y®,Y®, - ]. 


d) Give a relevant projection map [Y"® x Y®,Y®, -]. 
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e) For AC X, using a projection map give a bijective 
function [P(Y“4), P(Y*), -]. 


315 Ley, = 


R, forall x € 


R, A = (0,2). 


a) Draw 7 4(f) for f(x) = 2. 
b) Draw 774,(A) for A = (0,1) x (0,1). 


5. Cartesian Product 


NEW TERMS: 


Descartes, René , 42 

Cartesian product of a sequence, 42 

Cartesian product of an indexed family of sets, 43 
factor space, 43 

projection map, 44 


| Ld Yeu Yas ro| , ath projection map, 44 

LEX 

ath coordinate of a function, 44 

ath factor space, 44 

unit cylinder with a base, 46 

simple cylinder with a base, 48 

rectangle, 49 

parallelepiped, 49 

rectangular cylinder with a base, 49 
I] Y:, [[ Ya, 74], A-projection map, 50 
cEX acA 

™4(A), A-cylinder with base A, 50 

neighborhood of a function, 51 
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6. CARDINALITY 


One of the main predicaments in the theory of sets is finding a 
criterion for their “contents.” For finite sets we can count the number 
of elements they contain and assign them to their respective equi- 
valence classes. We can easily define an equivalence relation in such 
a class, for example, introducing C,, as the class of all n-element sets 
for every n € No. A partial order relation in this class would act as 
an appropriate comparison among sets from various classes. Sets A 
and B can be compared, in notation A < B, if and only if A € C,,, 
BeéC,, and n < s. Then we could assign to set A the number n 
and call it the cardinal number of A, in notation card(A) or |A| 
(which we use interchangeably). Doing this, however, we are up 
against defining cardinal numbers for infinite sets. A formalism of 
cardinality belongs to Georg Cantor who was the first to introduce a 
well-structured concept of “infinity” in his pioneering work done in 
the 1870s and 1880s. 


We present a rather informal version of cardinality sufficient for 
our goals throughout this book. A curious reader should be referred 
to special texts on set theory (cf. [Hrba] or [Jech] in the Bibliography 
Section). We start with comparison of finite sets that will enable us 
to extrapolate such principles to infinite sets. 


6.1 Definitions. 


(i) Two sets A and B are said to be equipotent if there is a 
(consistent) bijective function [A, B, f]. In this case we denote 
card(A) = |A| = card(B) = |B] (or A= B) and also say that A 
and B have equal cardinality. 


(iz) If there exists an injective function [A, B, f], then we say 
that the cardinality of A is less than or equal to the cardinality of B, 
in notation |A| < |B| or A x B. If |A] < |B] and |A| £|B| we 
write |A| < |B| or A < B. 


(iit) A cardinal number is an equivalence class containing all 
sets that are “ & -comparable.” 
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For most cardinal numbers we use the same notation as for 
regular numbers. For example, rather than using C,, for the class all 
n-element sets we just write n. 


(iv) Let 0 denote the cardinal number of the empty set @ (the 
only representative of this class). Note that 0 is not a number but the 
class containing ©. Thus, card(@) = 0. 

(v) Similarly, the cardinal number n is the equivalence class con- 
taining (among other n-element sets) the set {1,..., n}. Therefore, a 
set A is finite if it is equipotent with a set of some cardinal number, 
say n, such that the integer number n is an element of N, that is, 
|A| = |{1, ...,n}| =n. A set that is not finite is called infinite. 


(One can easily show that N is infinite.) 


(vi) A set A is said to be countable or denumerable if it is equi- 
potent with N and in this case we write card(A) = No (pronounced 
aleph naught or aleph nought). A set A is called at most countable if 
|A| < |N| or A XN. 


(vii) An infinite set that is not countable is called uncountable. 


(vii) A set A is said to have the cardinality of continuum if it is 
equipotent with the set R of real numbers and we write card(A) = c. 
(We show below that No < c.) O 


6.2 Remark. For every set 22, the property |A| = |B| induces an 
equivalence relation on the power set P(Q), and |A| < |B] induces a 
partial order on P(Q2) (see Problem 6.1). 


6.3 Examples. 


(i) If sets A and B have only finitely many elements, then 
A & B if and only if they have the same number of elements. 


(iz) In contrast with finite sets, an infinite set can be equipotent 
with a proper subset of itself. Consider A = {1,3,5,...} CN and 
define f(n) =2n—1, neéN. Then [N,A,/f] is bijective and 
Nw A. 


(ii) No + No x No. Indeed, the function 


f(k,n) = 2*(2n+1)-1 


60 CHAPTER 1. SET-THEORETIC AND ALGEBRAIC PRELIMINARIES 


is bijective from No x No to No. Similarly, N +N x N. 

(iv) Let {Aj, Ao,...} be a countable family of countable sets. 
Then its union A = U A,, is countable. To construct an appropriate 
bijective map we cy roneceal A as a countable union of disjoint 
sets. Let 


n—-1 
By = Aj, Bo = Ao\Aj,.--,;Bn = An\ LU Ax (for n = 1) donee ss 
k=1 


Then, clearly A = }>* | B,,. Without loss of generality, we assume 
that each set B,, is countable (some of them can be finite, but it is 
fair to say that they are at most countable) and, therefore, can be 
enumerated as: By = {bn1, Ong;<<« ty 17 = 1 2c. « We can’ place 
these sets in the matrix form: 


B= {bi1, bro, b13, bia, --- 
By = {ba1, bo, bag, bag, ... 
B3 = {b31, b39, b33, b34, eae 
Ba = {bay, baa, bag, bag, --- 


Now the desired bijective map [N,A, f] is formed as follows: 
f() = bi, f(2) — bio, f(3) — boi, f (4) _ b31, f(5) = boo, f (6) 


= bj3, cee 


(v) The set Q of rational numbers is countable, for the function 
f(™) = (m,n) is one-to-one from Q to (N x N)U {(0,0)}. The 
latter is countable by (777). 


(vi) We can show that No < c. Clearly, No < c. It is sufficient to 
show that N ~< [0,1], for [0, 1] + R (see Problem 6.5). If there is a 
bijective function [N, [0,1], f], then f(n) is of type 0.an,dn,... . 
Now define the number 0.b;b2... such that b; = 3 if a;,4#3 and 
b= 5. tt = 60 =— 1208 Then the number} ) = 0,005.0. 
cannot appear among the values of f(n), for it differs from f(n) at 
the nth place. On the other hand, b € [0,1] contradicts the assump- 
tion that f is surjective. Thus No < c. Observe that each rational 
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number has two representations, for example, 0.1 and 0.0999... . 
That means we have to be careful about the numbers above that 
appear “different.” O 


The following theorem is one of the central results in set theory. 
6.4 Theorem (Cantor). A < P(A) for every set A. 


Proof. The result holds trivially for any finite set A (see Problem 
1.4). Specifically, for the empty set, |O| = 0, whereas | P(@) | = 1. 
Because P(A) contains all singletons, it immediately follows that 
A x P(A). To show that | A| #4 | P(A) |, we assume that A = 
P(A) and get a contradiction. By our assumption, there exists a bi- 
jective map f: A — P(A). Then each element a in A is also an 
element of a subset of A that contains a. In other words, a may 
belong to f(a) (a subset of A) or may not. We then define B= 
{ae A: a€¢ f(a)}. B is nonempty, since there exists at least one 
element ap € A assigned to @. We pick a point b € A such that 
f(b) = B. By the definition of B, be B = b¢ f(b) =B, and 
this is a contradiction. O 


6.5 Remarks. 


(i) In Remark 5.2(izi) we showed that the power set P(X) of a 
set X is equipotent with the set {0,1}* of all functions f: X — 
{0,1}. Note that 2 is the cardinal number of the set {0, 1}. Thus, we 
conclude that |P(X)| = 2/*! (where we set |B|'4! = |B4|). In parti- 
cular, if |X| = |N| then |P(N)| = 2%. An interesting fact is that 2% 
=c, the proof of which is left for the reader as an exercise (see 
Problem 6.6). 


(it) The continuum hypothesis refines Cantor's Theorem 6.4, 
asserting that if a is an infinite cardinal, then there is no cardinal 6 
such that a < b < 2°. This was conjectured by Cantor for a = No. In 
1900 David Hilbert included the “continuum problem” as Problem 
#1 in his famous list of open problems in mathematics. In 1940 Kurt 
Gédel proved that the continuum hypothesis is consistent with (i.e., 
does not contradict) the axioms of set theory (axiom of existence, 
axiom of choice, etc.). In 1963 Paul Cohen [Cohe] showed that the 
continuum hypothesis is independent of the axioms. 
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(iit) The cardinal number 2° is called the hypercontinuum. For 
example, the set P(R) has the hypercontinuum cardinal. O 


Supplementary Historical Note. Modern set theory was 
founded by Georg Cantor, in a sequence of several articles that 
appeared between 1870 and 1880. One of these articles, “Uber eine 
Eigenschaft des Inbegriffes allen reellen algebraischen Zahlen,” 
appeared in Crelle's Journal in 1874, and is said to have given birth 
to set theory. Georg Cantor was born of Danish parents in St. Peters- 
burg, Russia, in 1845, and lived there until 1856, when his parents 
moved to Frankfurt, Germany. 

Cantor began his university studies at Ztirich in 1862. After one 
semester at Ztirich he moved to Berlin University, where he attended 
lectures of Weierstrass, Kummer, and Kronecker. Leopold 
Kronecker later became Cantor's main opponent, criticizing his 
concept of infinity and regarding it as theology and not as mathe- 
matics. (As an active theologian, Cantor became a target of attacks 
by his liberal opponents in Berlin University.) In 1867 Cantor 
received his Ph.D. (in number theory) from Berlin University. His 
dream to get a teaching position at Berlin University never came 
true, primarily due to the opposition of Kronecker. 

In 1869 Cantor was appointed to Halle University, where he 
remained until his retirement in 1913. Cantor died in a mental hospi- 
tal in Halle in 1918. In 1925 David Hilbert recognized Cantor's 
concept of infinity. He said, “No one can drive us from the paradise 
that Cantor created for us.” O 


PROBLEMS. 


6.1 Show the validity of the second statement in Remark 6.2: For 
every set Q, |A| < |B| induces a partial order on P(Q). 


6.2 Prove the Schréder - Bernstein theorem: /f A =< B and 
BHA, thn A = B. 
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6.3 


6.4 


6.5 


6.6 


6.7 


6.8 


6.9 


6.10 


6.11 


6.12 


6.13 


6.14 


(a) Let A be an uncountable set and let B C A be countable. 
Show that A\B is uncountable. (b) If |A] = ¢ and |B| < No, 
show that |A\B] = c. 


Prove that every subset of a countable set is at most countable. 
(Hint: Use the well-ordering principle.) 


Show that R ~ (0, 1]. 


Show that 2% = ¢. 


Let [X,Y, f] be a surjective map. Show that there is a subset 
of X equipotent with Y. 


Let [X,Y, f] be an surjective map, where Y is countable, and 
let f*({y}) be a countable set for each y © Y. Must X be 
countable? 


We call an algebraic number any root of a polynomial with 
integer coefficients. What is the cardinal number of all 
algebraic numbers? 


Prove the statement. Every infinite set contains a countable 
subset. 


What is the cardinal number of all polynomials whose coeffici- 
ents are algebraic numbers? 


Show that the set of all finite subsets of N is countable. 


Show that the set of all finite subsets of a countable set is 
countable. 


Let 2 = {B, : « € X} be a countable family (i.e., X ~ N) of 
countable sets. Let 


B=1{B,, x++* By rn € Ny A= {x1,..-;2,} CX} 
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denote the set of all finite Cartesian products from 2. Show 
that B is countable. 


6.15 Show that if f is a monotone (nondecreasing or nonincreasing) 
real-valued function on interval [a, b], then the number of dis- 
continuities of f is at most countable. 
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NEW TERMS: 
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cardinal number, 58 

Cantor, Georg, 58 

equipotent sets, 58 

cardinality, 58 
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7. BASIC ALGEBRAIC STRUCTURES 


Algebra is a mathematical discipline that studies algebraic 
structures. The most rudimentary algebraic operations with natural 
and positive rational numbers were already encountered in ancient 
mathematical texts. The famous book, Arithmetics, by the Greek 
Diophantos (of Alexandria) in the third century A.D., has a signifi- 
cant influence on the development of algebraic formalism. The term 
“algebra” stems from the text Al-jabr wa'l-mukhabala, by 
Muhammad Al-Khwarismi (an Arab mathematician of Persian ori- 
gin) in the ninth century A.D., which dealt with solution techniques 
for various problems that reduced to first- and second-order algebraic 
equations. Not until the end of the fifteenth century, when the 
common algebraic operations +, —, %X, power, roots, and 
parentheses were introduced, did one use cumbersome phrases and 
descriptions of algebraic expressions. Francois Viéte, by the end of 
the sixteenth century, was the first to use letters to denote unknowns 
and parameters. 

The algebraic symbolism, as we know it now, has been used only 
since the middle of the seventeenth century. Elementary algebra 
(which deals with arithmetic operations on real numbers, first- to 
fourth-order algebraic equations, binomial formula, Diophantine 
equations) was completed by the middle of the eighteenth century. 
Leonard Euler's Jntroduction to Algebra was one of the most 
prominent texts then. In the early nineteenth century, the algebra 
became furnished with five basic (commutative and associative and 
distributive) laws with respect to two algebraic operations, + 
(addition) and - (multiplication). On the strength of Dirichlet's 
definition of a function, later on, these operations were declared to be 
binary operations based on the following definition. An operation on 
a set A is a rule that assigns to each ordered subset A,, C A of n 
elements a uniquely defined element of the same set A. For n = 
1,2, and 3, the operation is called unary, binary, and ternary, 
respectively. 

The algebraic structures were formalized in 1830 by the British 
mathematicians George Peacock in 1830, Duncan Gregory in 1840, 
and Augustus De Morgan and further refined by the Germans 


7. Basic Algebraic Structures 67 


Hermann Hankel and Hermann Grassman. Abstract algebra is regar- 
ded as having been born in 1846, when Joseph Liouville published 
Galois' theory (of solvability of polynomial equations) based on the 
group concept, which began to spread within mathematics. In 1872, 
the German Felix Klein published a program in which he proposed 
to formulate all of geometry as the study of invariants under groups 
of transformations. In 1883, Norwegian mathematician Marius 
Sophus Lie published his fundamental work on continuous groups of 
transformations used in studies of continuous functions. Group 
theory, which is at the heart of contemporary abstract algebra, made 
prominent contributions to geometry, topology, and even physics in 
the twentieth century. 

In this section, we review some familiar algebraic structures. 
These provide a basis for analysis, shifting to more abstract settings 
in the upcoming chapters. 


7.1 Definitions. 


(i) A set G with a binary algebraic operation * (frequently 
referred to as addition + or multiplication - ) from G x G into G is 
called a semigroup, in notation (G, «), if * is associative. (Note that 
even though + or - may denote addition and multiplication, they 
need not mean the conventional algebraic operations known for num- 
bers.) 


(22) A semigroup (G, «) is called a monoid if there is an element 
1 € G (called a two-sided identity) such that for alla € G, x1 = 1 
n= me. 

(277) A monoid (G, *) is called a group, if for each x € G, there 


is a *-inverse x’ such that x * a’ = a’ *x2 = 1. 


If « is commutative (semigroup, monoid, or group), (G,*) is 
called commutative or Abelian. 


If we use for * symbol + or —, (G, +) is referred to as 
additive or (G, - ) multiplicative, respectively. It means that in the 
context of a certain binary operation, * can be identified as + or - 
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If (G, +) is additive, the identity element 1, denoted by @, is 
called zero, and the element x’ denoted by — x is said to be an 
additive inverse of x. 


If (G, - ) is multiplicative, the element 1 is called the unity. The 
element x’ is denoted by x! and is said to be a multiplicative 
inverse of x. 


(iv) A set R with addition + and multiplication - from R x R 
into R |ie., atriple (R, +, - )], is called a ring if: 


a) (R, +) is an Abelian group, 


b) - is associative, 
) 


CV 0,02 € 7, 
x:-(a+b)=a2-a+2-b (called the left distributive 
law) 
(a+b)-xc=a-x2+b-a (called the right distributive 
law). 


Observe that multiplication - need not be commutative in a ring. 
However, if this is the case, the ring is called commutative or 
Abelian. A ring need not have a unity (multiplicative identity) either; 
consequently, a ring equipped with a unity is called a ring with unity. 
[For instance, the set of all n x n matrices M(,,,,) is a noncommu- 
tative ring with unity (unit matrix).] 


(v) Let G, *) and (G, ¥) be two groups and let [G,G, f] (or 
more precisely, [(G,*),(G, ¥),f]) be a map preserving the 
algebraic operations *« and *, that is, such that 


fy) = fe) * FY). 


Then f is called a (group) homomorphism from G into G. 


If [G,G, f] is bijective then it is called an isomorphism. In this 
case, the groups (G, *) and (G, ®) are called isomorphic. If |G,G, f] is 
a homomorphism, and in addition, (G, *) = (G,¥ ), then [G,G, f] is 
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said to be an endomorphism. If [G, G, f| is an endomorphism and an 
isomorphism then it is called an automorphism. O 


A homomorphism preserves some (but not all) structural 
properties of groups, as the following theorem states. 


7.2 Theorem. Let [G,G, f] be a homomorphism. Then 
(i) foreachzx eG, f(ax') =[f(x)]’, and 

(ii) fO)=1. 

(See Problem 7.4.) 

7.3 Definition. Let |G, Gg: f| be a homomorphism. Define 


Kerf = f*({1}) 


and call it the kernel of f. O 
7.4 Examples. 


(2) The space of all continuous functions with operation + 
forms an Abelian group. The same space is not a group with 
operation of multiplication. 


(zz) All polynomials with operation + form an Abelian group. 


(ii7) (Z, +), (UR, +), and ((0,00), -) are Abelian groups, 
while (Z, - ) is an Abelian monoid. 


(iv) The space C\{(0,0)} with the operation complex multipli- 
cation - is obviously an Abelian group and (C, + , - ) is aring. 


(a) Let Cr = (")([a, b];IR) denote the space of all n times 
continuously differentiable real-valued functions on [a, b] C R. Then 
Ge , +) is a commutative group. If D" f denotes the nth deriva- 
tive of a function f, then [Gar +) : (Goi +) ,D"| is a group 


homomorphism. Replacing ci a with the space # of all polynomi- 


als on [a, b], we have [H% AD") as an endomorphism. 
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(vi) Consider two groups (IR, +) and ((0,0co), -) and the 
function f(#) = e*. Then, [IR, R,, f] is an isomorphism. Indeed, 
f(a +y) = f(x)- f(y). In addition, [R, R,, f] is bijective. 


(vii) Let F = F(X; Y) = Y~* be the space of all functions from 
X to Y. Then, (F,-) is a multiplicative monoid. For any 
nonnegative integer n and f € F, define the unary operation power 
f" on F as f° =1, f"*! = f - f”. The power has the properties, 
f'- f* = f't* and (f')* = f*. Note that the power can be defined 
on an arbitrary multiplicative monoid with the above properties. 


wz 


(viit) The set Mio.) of all nonsingular 2 x 2 matrices is a multi- 


1 0 
0 |) Because 


for each AE Mi2.2)s there is a unique two-sided multiplicative 


plicative group with the two-sided unity 1 = I, = ( 


inverse At. 
Now, a function T from C onto C (where C = CU {oo}) is 


called a bilinear transformation if T(z) = sett if | : : | =O, Let 


T denote the set of all bilinear transformations. Then, (7, ©), 
where o stands for composition, is a multiplicative group with the 
two-sided unity 7 = 1 such thata =d=1, b=c=0. 


Indeed, firstly we notice that a bilinear transformation 7’ in the 


form T(z) = 272 can be associated with a nonsingular matrix 
— { G11 412 * 
A= & M22) 
a21 422 


through the function f defined as T = f(A). Furthermore, it can be 
readily shown that the composition ToS of TJ with 
S(u) = 'uutoe — £(B), where 


by, u-+bo9 


— f bu bie . 
es & 2) = ae): 
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can be associated with the matrix product AB = A- B. Conse- 
quently, 7" stands in association with A!. In other words, f(A7') 
at Sa 


In a nutshell, (Mis, . ) and (J,©) are two isomorphic 
groups, with isomorphism (Maa, Zz. f) such that 
J(AB) ST oS =f(A)o f(b). 
The above discussion agrees with Theorem 7.2 where 
(a) foreach Ac Mi, f(A) = f(A). 
(6) fe) =1. 


(ix) Let F = {[X, X, fi]; t > 0} be an indexed family of func- 
tions and let « be some binary operation defined on F. (F, *) is 
called a semigroup (of functions) if fy = 1 and for all s,t > 0, f, * 
fi = fs+t. Obviously, the semigroup (F, *) is an Abelian monoid. 


(2) Let 14 (CR) be the space of all sequences such that for 
cach X= (oy @iyiun) Ely & 9 |@n| <0o. Delime ‘the 
following operation on /'. For x and y, let z = (2, 21,...) =x*y is 
such that z,, = > pleunk (called discrete convolution). The 
operation * is commutative and associative and it is closed within /' 
(see Problem 7.12). Obviously, 1 = (1,0,0,...) is the two-sided 
unity of (I', *) and thus (/', +) is an Abelian monoid. 


(xi) Let F, = F(R; R) denote the space of all bounded real- 
valued functions. For a function A € F,, define 


ule 
Uw eAlu) _— ee LA) 


in agreement with the unary operation power of Example (v7). 
Obviously, for each u, the above series converges absolutely, 
because there is a positive constant WV such that 


co |A(u)|* co MF _ om 
Sat a Se 
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so that e“ is again an element of F,. For a fixed A, define the family 
of functions f; = e*. t > 0. From the above definition of e“ it 
follows that fy = 1. It is easy to show that e*4e’4 = e+, Indeed, 
9 3t Ai 9 tk Ak sith 
0" 5 — er se TRI Ar 
itk=r 
{0<i,k <n} 


= Uinmo t Lino Gaont = Logs +4)". 


s+t) 


The last expression yields e+4 when letting n — oo. Con- 
sequently, (e'4, -) is a special case of a semigroup introduced in 
(7a). This example can be generalized for operators, for instance, 
square matrices. To discuss such cases rigorously, one would require 
the concept of the norm of operators. oa 


7.5 Definitions. 


(i) Let F be a nonempty set with two binary operations, additi- 
on and multiplication (very often, for the elements of F, we drop the 
conventional multiplication symbol - ). (F, +, -) is called a field if 
it is a commutative ring with unity and if for every a # 0 there is a 


multiplicative inverse a“. 


In other words, F is a field if for all a, 6,7 € F the following 
properties hold. 


1) (commutative law) a+ B= G+a, af = Ba. 

2) (associative law) (a+8)+y=at+(6+y7), (ab)y= 
a( Gy). 

3) (zero) there is an element 0 € F such thata +0 =a 

4) (additive inverse) there is an element —a€F such that 
a+(—a) =0. 

5) (distributive law) a(G + y) = aG+ ay. 

6) (unity) there is an element 1 € F such that la = a. 

7) (multiplicative inverse) for every a#0, there is a 1 ¢F 
such that aa~! = 1. 


The elements of a field are called scalars. 
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(ii) Let F be as above with the exception that F does not have 
additive inverses. Then F is called a semifield. We will denote a 
semifield by F , 


[The set of all rational numbers, Q, the set of real numbers, R, 
and the set of all complex numbers, C, are the most common 
examples of fields. The set of all nonnegative rational or real num- 
bers and the set of complex numbers z € C with Re(z) > 0, are 
examples of semifields. | 


(iii) A linear or vector space X over a field F is a nonempty set 
with the binary operations addition (+) from X x X into X and 
multiplication (- ) from F x X into X such that 


1) + is commutative and associative, 

2) there exists an element (called an origin of X), 0 € X such 
thatO-2 = 0, Va € X, 

3) leeSHa, Ve eX, 

4A)a(za+y)=ar+ay, (a+ f~)t=ar+ Br, Va,PeF, V 
oy eX, 

5) a(Gx) = (aB)z, Va, BEF, Vx EX. 


A vector space X over the field F or semifield F, is often denoted 
by (X, Ps ae ) or (Ea, a es Te 


(iv) Elements of X are frequently called vectors. If F = R then 
X is called a real vector space. If F = C then X is called a complex 
vector space. If in (iii) a field F is replaced by a semifield F,, then 
we call X a semilinear or semivector space. 


(v) Any subset of a vector space, which itself is a vector space, 
is referred to as a vector (or linear) subspace (subspace for short) or 
a linear manifold. 


(vi) A ring (A,+, -) is called an algebra over a field F if its 
additive (Abelian) group (A, +) is a vector space over F. An algebra 
over a field F is denoted by (A;F). If (A;F) is an algebra, a pair 
(A’; F’) is called a subalgebra [of (A;F)], if A’ C A, F’ CF, and 
(A’; F’) is also an algebra. The above characteristics of commutative 
rings and rings with unities are hereditary for algebras. O 
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7.6 Properties of Vector Spaces. 


(i) By Definition 7.5(i77), 2) and 3), we have 0+2=0-2 
+1-2=(0+1)-x=-2. Therefore, the origin 6 is a zero and, 
according to Problem 7.1, it is unique. 


(it) For every x€X, there exists —z_ such that 
x +(—«) = 0. Indeed, by Definition 7.5(7i7), 2) and 4), we have 


6=0-r2=([1+(-D)-cx=2+(-1)-z. 


We call (— 1) the additive inverse of x and denote it by — 2. 
[The reader should not confuse the additive inverse — a of a scalar 
a previously introduced in Definition 7.5(7) with the additive inverse 
— x of a vector x first appearing in 7.6(72).] 

(iiz) Properties (2) and (71) imply that a vector space (X, + ), 
with the operation of addition, is an Abelian group. 


(iv) VaeEF, ab =a(0-2) =(a0)-r=0-2=8. O 
7.7 Definition (Axioms of a Partially Ordered Vector Space). 


(i) A vector space (X,F,+,-) is said to be partially 
ordered, in notation (X,F,+,-, =), if it obeys the following 
axioms. 


(Al) F=R. 
(A2) exy>aut+2zxy+zforeachz ec X. 
(A3) 0X a2 => 0X ax foreacha > 0. 


(it) From (7) we see that x <x y <0 ~< y—«. We may say then 
that the partial order on X is entirely determined by X*: = 
{a € X : 0 X x}. The subset X* is called the positive cone of X. O 


7.8 Proposition (Properties of Partially Ordered Vector 
Spaces). Let (X,R, +, -, < ) be a partially ordered vector space 
and let A, B C X. Then the following properties hold. 


(i) Ifa>6, then —x <0. 
(ii) Ifa > 0 anda < 0, then ax ~ 0. 
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iti) For AC X anda € X, sup(A+ 2) = supA+z. 


( 
(iv) inf(A+2) =infA+zc. 

(v) sup( — A) = — inf(A) and inf( — A) = — supA. 
( 

( 


vi) For any a € R,,asupA = sup(aA). 


vii) sup(A + B) = supA + supB. 
Proof. 
(i) Indeed, by Axiom 2, 
—-“+2r04(-2) S0> —z. 
(iz) Indeed, 
a@ = (-—1)(—a) and since — a > 0, by Axiom 3, 
— az > 0. But then, by Property (i), (— 1)(— ax) x 0. 


(iti) Indeed, let U(A) = {y € X : y = A} and up = supA. Ob- 
viously, for each a € A and u € U(A), a X up X u. By Axiom 2, 
then 


a+xex2x~ugturnute. 
From the second relation, we have 

A+azx~utuxU(At+z). 
Therefore, up + « = sup(A+ 2). 


(iv) Indeed, let us denote the set L(A) = {ye X:y X A} of 
all lower bounds of A and let /,, =infA. For each a € A and 
| € L(A), we have that 1 < 1, x a. By Axiom 2, 


l+axxl,+tux~at+ez 
or in the form 
L(iA+ a2) nXlyotaxAt+e 


thereby implying that /,, + « = inf(A +2). 


(v) Let L(A) and U(A) be the sets of all lower and upper 
bounds of A and denote /,, = infA and up = supA. Then, we have 
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L(A) Xl XA XU XU(A). 


By Property (7), 


=> —L£(A)=U(- A). 
Similarly, — U/(A) = £( — A). So, we have 
LAS =m SHAS Higa =A), 
Therefore, — wo = inf( — A) and — 1, = sup( — A). 
(vi) Indeed, if up = supA X U(A), we have 
AxXwup <U(A) or axXup Xu, foranya € Aju Ec U(A). 


Thus, by Axiom 3, aA 5 aa X aup X au € U(aA) and it yields 
that auy = sup(aA). 
(vii) Using property (777) we have 
supA + supB = sup(A + supB) 
= sup{a + supB: a € A} 
= sup{sup{a +b:be€ Bh: ae A} 
= sup{a+b:a€ Aandbe€ B} =sup(A+ B). O 


7.9 Definition. 


(i) Recall [Definition 4.7 (vi)] that a partially ordered set 
(X, x ) isa lattice if given any two-element subset {x, y} CX, the 
supremum x Vy and infimum x / y are also elements of X. A 
partially ordered vector space (X,R, +,-, < ), which is also a 
lattice, is called a vector lattice or Riesz space. 
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(ii) Let X be a semivector space over the semifield R, which 
is a lattice. Then, the space (X,Ri, +,-, x) is called a semi- 
vector lattice. 


(iit) Let X be a vector lattice. Then V x € X, define 


x*: =x V 6 (positive part) 

x: =(-—2) V 0 (negative part) 

|x|: = x V (—x) (modulus). O 
7.10 Properties of Vector Lattices. 


(i) («Vy)+z2=(e+2)V(y+z). This follows directly 
from Property 7.8 (777). 


(it) (c@Ay)+z=(a@+2)A(y+z). This follows directly 
from Property 7.8(iv). 


(iti) « = xt — ax”. Indeed, by (i), 
(c V0) —ax2=sup((e—2),0d-z)=(-2z)VO=a2. 
(iv) at + a7 = |z|. Indeed, by (ci2), 
ope =o+e +e =o4+20- 


= 2|(— x) V 6] + x [by Property 7.8(vi)] 
=(-2¢V0)+a= -2Vr= |e. O 
7.11 Examples. 
(i) {0} is a subspace, since by Property 7.6(izi), a- 0 = 0. 


(ii) Any field is a vector space over itself. 


id 


(iz) IR" is a real vector space with 0 = (0,...,0) over 


(iv) I’ space, with all real sequences over the field R whose 
series are absolutely convergent, is a vector space. 


(v) 1? space over the field C, of all sequences such that for each 
@ = (21, 7);...) el, 24 |eal < 00; where pe |1,c0); is 4 
vector space. (See Problem 7.11.) 
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(vi) Cjas) space of all continuous functions on [a,b] is a real 
vector space. 

(vit) Ci, 4) Space of all n-times differentiable functions on [a, 5] is 
a real vector space. 

(viii) C™ space of all infinitely differentiable functions is a 
complex vector space. 

(ix) In Example 7.4 (x), (I?\I} U {0}, +,*), where 6 = (0,0, 

..), iS a field, because elements of PAL have multiplicative 

inverses. (C, + , - ) is another example of a field. 


(x) The space R* of all real-valued functions on a set X is a 
commutative algebra over R with unity. R* is also a vector lattice. 
Indeed, firstly, (R*, +, -) is a vector space over the field R. 
Secondly, a common partial order on R* can be defined as f = g if 
and only if f(x) < g(x) for all « € X. Furthermore, R* is a lattice. 
For example, we set f V g = {max{ f(x), g(x)}: a2 € X}. 


(xi) The subspace F,(X;IR) C R* of all bounded real-valued 
functions on a set X is a commutative subalgebra with unity and a 
vector lattice. 


(xii) The subspace C(X; R) of all continuous functions is also a 
commutative subalgebra over R with unity and a vector lattice. 


(xiii) The subspace C,(X;R) of all bounded continuous 
functions is a commutative subalgebra of C(X;IR) and a vector 
lattice. 


(xiv) The subspace C”(R; R) of all n-times differentiable func- 
tions (likewise, the subspace C)(IR;R) of n-times continuously 
differentiable functions) is a commutative subalgebra with unity but 
not a lattice (sup{x, —x} = |a| ¢ C"(R,R)). 

(xv) The space C*(C; C) of all infinitely differentiable functions 
over C is a commutative algebra with unity but not a lattice. 


wz 


(xvi) The space & of all polynomials with real coefficients is 
a commutative subalgebra over R with unity but not a lattice. 
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(xvii) The space Q of all polynomials with rational coefficients 
is a commutative subalgebra over the field of rational numbers with 
unity but not a lattice. O 
PROBLEMS 


7.1 Show that each monoid has exactly one identity. 


7.2. Let (G,*) be a group. Show that for each two elements 
x,y €G, there arel,r € G, such that!xx2 = yandz*r=y. 


7.3. An operation « is called reducible if x * y = x *« z implies that 
y = z for all x,y, z. Show that if (G,*) is a group, then * is 
reducible. In particular, show that for each x € G, its inverse is 
unique. 


7.4 Prove Theorem 7.2. 


7.5 Let [G,G, f] be an isomorphism. Show that [G,G, f—'] is also 
an isomorphism. 


7.6 Show that in Definition 7.3, 1 € Kerf. 


7.7 Let [G,G, f] be an isomorphism. Find Kerf. 


7.8 Let [G,G, f] be a mapping such that G = G = R with operation 
+ and let f(x) = [a] (i.e., the greatest integer less than or 
equal to x). Is |G, G, f] an endomorphism? 


7.9 Let G be the set of all 2 x 2 real matrices whose determinants 
equal 1. 


a) Show that (G, - ) (with the usual matrix multiplication) is 
a group. 


80 
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7.14 
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b) Let B be any 2 x 2 nonsingular matrix. Define the map 
[G,G, f] such that f(A) = B-'AB. Show that [G,G, f] 
is an automorphism. 


Show that Va,b >0 and p€ [1,00), (a+b)? < 2? '(aP + 
b?). [Hint: For p> 1, work with the auxiliary function f(z) 
=(a+a)?— 21 (a? +2”), «> 01] 


Show that /? (defined in Example 7.11 (v)) is a vector space 
over C; specifically show that x,y El? > x+yeEl?. 


Show that the operation « in Example 7.4 (x) is commutative 
and associative and it is closed in /', that is, show that if 
x,y El, thenxxy el’. 


Is (J, +, ©) [where (7, ©) is defined in Example 7.4 (viiz)] 
a ring? 


Let S be a subset of C. Argue for what cases S is a subspace of 
C over R: 


a) S is a closed unit disc centered at zero; that is, S = 
{zEC: |z| < 1}. 


b) S = {z € C: {|Re(z)| < 1} x {|Im(z)| < 1}}. 
c) S = {z € C: {Im(z) = 0} x {|Im(z)| < 1}}. 


d) S = {z €C: Im(z) > 0 and Re(z) > O} U{zEC: 
Im(z) < 0 and Re(z) < O}. 


In the condition of Example 7.4(x), let x = (x, 21,...)€ 0 
such that 7) ~ 0. Show that for each x with x # 0, there is a 
unique element x~! with respect to * such that x*x-! = 1. Is 
x! the two-sided inverse of x? If this is the case, show that 
(U'\1}, *) is an Abelian group, where /j denotes the subset of 
all elements x € /' with 2) = 0. Is the triple (/', +, *) an 
Abelian ring with unity? 
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7.16 Let (X,R, +,-, x ) bea Riesz space and let A, B C X such 
that infA, infB, inf(A+ B) € X. Prove the following equa- 
tions. 

(i) inf(A+B) = infA + infB. 
(it) Foranya > 0, ainfA = inf(aA). 


(iti) For any a < 0, the following hold: asupA = inf(aA) 
and ainfA = sup(aA). 


7.17 Show the validity of the following equations in a Riesz space 
(X, R, a gc =). 


i) WaeVy)=at+ytlc—yl. 
it) WaeAy)=ax+y-—|x—-yl. 


iti) c+ty=(a£Vy)t+(rAy). 


oO) 2 har =e, 


vi) Jaa] = |al|e]. 


( 
( 
( 
(w) at Va =(|gl. 
( 
( 
( 


vii) |x + y| + a — yl = 2([a] V [yl). 


7.18 Let (X,R, +,-, <) be a Riesz space. Prove the following 
inequalities. 


(i) Wesay that A <uifu€U(A). Let AX uand B xu, 
then sup(A U B) X u. 


(ai) Wax gandy < 2, thenz Vy ~ z. 


(iti) Ifa X zand — x X z, then |z| ~ z. 
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(iv) The inequality — z X x X z is equivalent to |x| ~ z. 


(v) The triangle inequality: |” + y| x |a| + |y]. 


za 


7.19 Let (X,R, +,-, <) bea Riesz space. If  € X anda € 
prove the following: 


(i) (ax)* =ax* and(axr) =azx ifa>0. 
(it) (ax) = —ax and (ar) = —aa* ifa <0. 
(iti) (ax)" — (ax) = az, foralla ER. 


7.20 Show that the positive cone X~* has the following properties: 
X*(— Xt) = {6}. 
XtT+ Xt CX, 
aXt C Xt, foralla > 0. 
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8. MORE ABOUT VECTOR SPACES 


In this section we study vector spaces more systematically in re- 
sponse to the needs of forthcoming topics on normed vector spaces 
(Chapter 2) and Hilbert spaces [Dsh2]. Most of these materials can 
be found in linear algebra textbooks. 


8.1 Definitions. 


(i) Let V be a vector space over a field F. Let {v,,...,u,} bea 
finite subset of V and let {a1,..., an} C F. The vector 


QyVy +e Ann € V 


is called a linear combination of {v1,..., Un}. If a linear combina- 
tion a1U, +--+ Q,Un = 6 and it implies that ay = ag =--- = Ay 
= 0, then the set {v,,..., vn} C V is said to be linearly indepen- 
dent. We set by definition the empty set @ as linearly independent. If 
a finite subset {v1,..., Un} C V is not linearly independent then it is 
called linearly dependent. An arbitrary subset A of V is called Ji- 
nearly independent if each of its finite subsets is linearly independ- 
ent. If A has at least one finite linearly dependent subset, then A is 
said to be linearly dependent. 


(it) The intersection of all subspaces of a vector space VY that 
contain a certain subset A C VY is called the /inear span (or linear 
hull) of A and it is denoted by span A. 


Observe that a linear span is a subspace of V. Indeed, if 
x,y € spanA, then x,y are elements of all subspaces containing A. 
Obviously, ax + Gy is an element of all subspaces containing A and 
thus it belongs to spanA. All other properties of spanA as a vector 
space can be shown similarly. Obviously, a linear span is the mini- 
mal subspace containing set A. We refer to spanA as the subspace 
generated (or spanned) by A. Any subspace S is a trivial example of 
its linear span, that is, spanS = S. 

(iit) A subset H C VY is called a Hamel basis (or just basis) for 
V if 

(a) H is linearly independent, 
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(b) spanH = V. O 
8.2 Remarks. Let V be a vector space. 


(i) It is clear that span@ = {0}, because {6} is the minimal sub- 
space containing the empty set 0. 


(it) If A @, then spanA equals the set Ao of all linear combi- 
nations of elements of A. Indeed, Ag is a subspace containing A and 
therefore, spanA C Ao. Let vg € Ag. Then vp = ayX%, +--+ + AnFn 
such that 2; € A C spanA. On the other hand, because spanA is a 
subspace itself and because x; € spanA, it must contain vp. 
Therefore, Ag C spanA implying that Ag = spanA. 


(iii) If H is a Hamel basis for V, then Remark (77) implies that 
each vector of V can be represented as a linear combination of 
elements of H. We can also convert this statement; namely, if H is a 
subset of V such that any vector of VY is a linear combination of 
elements of H, then H is a Hamel basis. It is easy to see that any 
such representation is unique relative to one particular Hamel basis. 
Indeed, let v € V and suppose we have two representations of v such 
thate = $7 je;hs = S04 beli.. Then, 


yes + Dyes (— Behe = 0. 


If h;'s and h;,'s are all distinct, then due to linear independence of H, 
a; = 0; = 0. Even if for some i and k, h; = hi,'s, again by linear in- 
dependence of H it follows that a; = (x. O 


8.3 Example. Let V = C": = C x Cx --: x C (n times) be the 
n-dimensional complex space. It is a vector space over the field C. 
Denote by H:= {e1...,en} the set of unit vectors e;: = 
(0,...,0,1,0,...,0) where 1 is the ith entry. This set is linearly in- 
dependent. Let ¢ € C” be an arbitrary vector with coordinates 
(C1,---,Gn) where ¢; = x; + iy;. Then, by choosing a; = ¢; we can 
represent ¢ as ¢ = a1 85eF- Therefore, H is a Hamel basis for C”. 
Observe that H is not a unique Hamel basis for VY. For example, in 
the above definition of e; we can replace the 7th element 1 with — 1 
and do so with some other e,'s . O 
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8.4 Theorem. Any two Hamel bases in a vector space V have the 
same cardinality. O 


For finite Hamel bases, the proof to Theorem 8.4 is available in 
most linear algebra text books. For the general case, a proof can be 
found in [Hew], Theorem 4.58, p. 30. 


8.5 Remark. As we have seen, a vector space can have more 
than one Hamel base. However, it can be proved that all of them 
have the same cardinal number [card(/7)]. In other words, if H; and 
Hy are two Hamel bases for VY, then there is a bijective map f : 
Ht, — Hp, that is, H, and Ho are equivalent sets. Specifically, if H, 
is finite, then 7, is also finite and they have the same number of ele- 
ments, that is, card(H,) = card(H2). [Recall that Xo: = card(N) and 
c: = card(R) are the most frequently used infinite cardinals.] Thus, 
each vector space V is characterized by its “dimension,” in notation 
dimY, that is by the cardinal number of one of its Hamel bases. 
Correspondingly, we distinguish finite- and infinite-dimensional 
vector spaces. For instance, in the above example, dimC” =n. O 


8.6 Definitions. 


(i) Let V and W be vector spaces over a field F. An f: V 
— W is called a linear map (or linear operator) if for all x,y € V 
anda € F, 


(a) f(xt+y) = f(x) + f(y) (where + is the operation of 
addition in W) 


(6) fla-«)=a- f(a). 


(ii) A linear map f:V — F (where VY is a vector space over 
field F) is called a linear functional. 


(iit) Replacing field F in (2) and (iz) by a semifield F,, we have 
the notions of a semilinear map and a semilinear functional, res- 
pectively. 


(iv) Let V and W be vector spaces over a field F. If the linear 
map in (7) is bijective, then we call it an isomorphism. [For a more 
general notion of an isomorphism see Definition 7.1(v).] The vector 
spaces Y and W are called isomorphic. O 
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8.7 Remarks. 


(i) Property (a) in Definition 8.6 (i) makes a linear map 
[V, W, f] a homomorphism from (V, + ) into (W, +). By Theorem 
125 


VaeV, f(—x) = — f(x) and 
f(y) = Ow, 


which we arrived at without the utility of property (0). 


(it) According to Definition 7.3, f*(@wy) = Kerf (often referred 
to as the null space of f). By Problem 7.6, 0) € Kerf. O 


8.8 Proposition. /f[V, WV, f| is a linear map, then 

(i) the subset Kerf C V is a linear manifold. 

(it) the subset f,(V) C W is a linear manifold. O 
(See Problems 8.1 and 8.2.) 

8.9 Examples. 


(i) Let I. be the set of all real-valued sequences with all but 
finitely many zero elements (called the sequences with finite sup- 
port). |, is a vector space over R with the usual operations of addi- 
tion and multiplication by scalars. Clearly, |. is isomorphic to the 
space #of all polynomials with real coefficients. 


(it) If V is a vector space over a field F, with a finite Hamel 
basis H = {hy,...,h,}, then each vector v € V can be uniquely re- 
presented as v=ayh, +aghg+---+anh,. Hence, each vector 
v€YV can be associated with a unique vector f(v) = (a1,..., 
Q,) € F” and conversely, each vector (aj,...,Qn) € F” can be 
associated with a unique vector f~'(ay,...,@,) = v € V. Observe 
that F” is a vector space over F. In addition, ax + By = (aa, + 
BB )hy +--+ + (AQ, + GBn)hn, which implies that V and F” are 
isomorphic. This is why we often prefer to use the term “vector 
space” over the term “linear space.” O 


Specifically, if F = C, the above considerations can be sum- 
marized as the following proposition. 
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8.10 Proposition. Any n-dimensional vector space over the field 
C is isomorphic to C”. O 


The following theorem establishes the existence of a Hamel basis 
in a vector space. 


8.11 Theorem. Each vector space VY with two or more elements 
over a field F has a Hamel basis. 


Proof. Let F be the family of all linearly independent subsets in 
Y. Clearly, F # © due to our assumption that card VY > 2. Obvious- 
ly, F satisfies Tukey's Lemma 4.14 according to which F has at 
least one maximal element in the sense of partial order C . O 


Now, with the aid of Theorem 8.11 we can get a very useful 
criterion for Hamel bases. 


8.12 Theorem. H is a Hamel basis for a vector space V over a 
field F ifand only if H is a maximal linearly independent set. 


Proof. (i) Let H be a Hamel basis and let x € Y such that x # 0. 
Then, x can be represented as a finite linear combination 


c= Dy yaih; (a; € F, A; € H). 


Thus, {x,h1,...,h,} is a linearly dependent subset, while H U {x} 
is linearly dependent, evidently for any x € Y\H. Therefore, H is 
indeed a maximal linearly independent set in V. 


(ii) Now suppose H is a maximal linearly independent set in V. 
We will prove that H is a Hamel basis. Let 6 4 hy € V\H. Then, 
H U {ho} is linearly dependent. Therefore, there is a linear combina- 
tion nes = @ such that h,,...,h, € H and that not all a's are 
zeros. If ag = 0, then h,,...,h, are linearly dependent, which is im- 
possible. Thus, 


Here not all a; = 0, because hy # 6 by our assumption. If hyo € A, 


then ho = 1 - ho. Consequently, hy € spanH and thus VY C spanH. 
So, H is a Hamel basis. i 
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And one more useful result. 


8.13 Proposition. Let A be a linearly independent subset of a 
vector space X. Then, there is a Hamel basis H for X such that 
ACH. 


Proof. If there is no Hamel basis H for X such that A C H, 
then set A is a maximal linearly independent set in X and thus is a 
Hamel basis itself. O 


PROBLEMS 


8.1 Prove the statement in Proposition 8.8(i) that the subset 
Kerf C V is a linear manifold. 


8.2 Prove the statement in Proposition 8.8(ii) that the subset 
f«(V) © W is a linear manifold. 


8.3. Let X bea finite-dimensional vector space and let Y C X bea 
linear manifold. Show that dim Y < dim_X. 


8.4 Let X bea finite-dimensional vector space and let Y C X bea 
linear manifold such that dim Y = dim X. Show that Y = X. 


8.5 Let X be a vector space and A, B C X. Define A+ B= {x 
+y:x€A,y © B}. Prove that if A and B are linear mani- 
folds, then A + B is a linear manifold. 


8.6 Let X be a vector space and let Y, Z C X be two linear mani- 
folds. Prove the statement. For each x € Y + Z, there is a 
unique decomposition of x by vectors y © Y and z € Z such 
that c=y+z if and only if YNZ={6}. (Two such 
manifolds are said to be /inearly disjoint.) 


8.7 Let X be a finite-dimensional vector space and Y, Z C X be 
two linearly disjoint manifolds. (See Problem 8.6.) Then, 
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Y + Z 1s called a direct sum of Y and Z and it is denoted by 
Y @ Z. Prove that if X =Y @Z, then dim X = dimY + 
dim Z. 


8.8 Show that the set Cre] of all infinitely differentiable functions 


on [a, 7] is not finite-dimensional. 


8.9 Show that the differential operator o Chay — Crap] is linear 


with respect to R. 


8.10 Let A be an n xX m matrix. Show that A: R™ — R” is a linear 
operator with respect to R. 
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Chapter 2 
Analysis of Metric Spaces 


Metric spaces were introduced and studied by the French mathemati- 
cian, Maurice René Fréchet (in his doctoral dissertation published in 
1906), and developed later by the German Felix Hausdorff (in his 
1914 book Grundztge der Mengenlehre). It was apparent that to the 
end of the nineteenth century the mathematical world (partly inspired 
by Cantor's fundamental work in set theory) was eager to structure 
more general sets than conventional IR”. On the other hand, the needs 
of complex analysis and the rash development of differential equa- 
tions accelerated this process. Typical examples are uniform conver- 
gence in function spaces, approximation of continuous functions by 
polynomials and the Riemann mapping theorem. After 1920, the 
theory of metric spaces, especially fundamental work on normed 
spaces and their applications to functional analysis, was further de- 
veloped by the Pole Stefan Banach and his school. Paying tribute to 
their achievements and of their fellow countrymen followers, an im- 
portant subclass of metric spaces was named “Polish.” A series of 
studies of metric spaces was further undertaken in the late 1920s by 
the Russian school of analysis. At this time, metric spaces have 
become generalized to topological spaces. 


In this chapter we introduce the main principles of metric spaces 
and their special case: normed vector spaces. This part of analysis 
traditionally precedes the more general theory of topology and 
functional analysis. 


J.H. Dshalalow, Foundations of Abstract Analysis, 93 
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1. DEFINITIONS AND NOTATIONS 


The concept of “metric” (measuring distances in space) is at the root 
of mathematical (geometric) thinking. Starting with that concept we 
show how the concepts of limits of sequences and continuity of 
functions can be extended by metrization of spaces more general 
than Euclidean spaces introduced in calculus. Recall that a point x is 
a limit of a sequence {x,,} if all terms of the sequence numbered 
with k,k+1,... for some k are sufficiently “close” to x. The 
closeness of these points to x was defined in terms of the Euclidean 
distance |x — x;,|, which determined the specific structure employed 
on R. In many applications, an underlying carrier is often more 
general than R or even R”. So, the question arises, “how do we build 
the analysis in the general space?” Since the distance was crucial in 
the formation of analysis on the real line, we therefore introduce this 
notion also for the general space, emphasizing the main properties of 
the distance we have had experience with. Once a distance (or 
metric) between any two points of a given set is defined, the set be- 
comes “well-structured” or metrized, and then is ranked as a space, 
more precisely, a metric space. 


1.1 Definitions. 


(i) Let X be a nonempty set. A metric d (or distance) on X is 
any nonnegative function d: X* — R, such that: 


(a) Va,yEX, d(a,y)=0 S r=y. 

(6) Va,ye X, d(z,y) =dly, 2). 

(c) Va,y,z € X,d(x,y) < d(a, z)+d(z, y) (triangle 
inequality). 


The pair (X,d) is called a metric space. As usual, we refer to set X 
as a carrier. Sometimes, for brevity, carrier X itself is called the 
metric space. 


(2) If for z,ye X, x =y implies d(x,y) = 0, but the con- 
verse does not hold [i.e., d(x, y) = 0 does not yield x = y], and (6b) 
and (c) hold, then d is called a pseudo-metric. Correspondingly, the 
pair (X, d) is called a pseudo-metric space. 
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Any pseudo-metric can be “made” a metric by introducing the 
equivalence classes generated by d in such a way that x and y will 
belong to one and the same class whenever d(x, y) = 0. The corres- 
ponding metric space will then turn to (X|Eqz,d), where X|E¥, is the 
quotient set of X modulo Eg and d is the reducer or d(as per 
Theorem 4.4, Chapter 1). O 


1.2 Remark. By the triangle inequality we have 
d(x, y) — d(zy) < d(a, z), (1.2a) 


which holds for all x,y,z € X. Then, interchanging x and z in the 
last inequality we arrive at 


Inequalities (1.2a) and (1.2b) yield 
|d(x, y) — d(y, z)| < d(x, z), Wau eS A. (1.2c) 


oI 


Let Y C X. Then the pair (Y, d) is also a metric space, called a 
subspace of (X,d). 


1.3 Examples (of metric spaces). 
(i) The discrete metric is defined on a nonempty set X as 


uen={q 224 


L=%7, 


The triangle inequality does not hold if and only if d(x, y) = 1 and 
d(x, z) = d(z, y) = 0. However, the latter can only be possible when 
x= z=y. Hence, d(x, y) cannot equal 1. 


(ii) Let X = (0,00) and d(z,y) = |4 — 7 . The triangle in- 
equality follows from 
ast. 4a) = |e A | 
d(x,y) = x 1 = E z + z 1| 
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14 
= 1| = d(z,z)+d(z,y). 


SF aes 


(iti) Let X consist of all sequences {x,,} C R. Such a carrier X 
is denoted by RN. Recall that a subset of R™ is the I' space if it 
contains only absolutely convergent sequences, 1.e., those with 


Dini |Z, | < oo. 


Let us define the function d on I! as d(x,y) = 0°, | Un —Yn | - 
Then 
oe) 
y) =>, | Bn — Zn + Zn — Yn | 


n=1 


n=1 


ae, 
(ove) ove) 
< > | tn — Zn | +55 | Zn — Yn | 
n=1 


= d(x, z) + d(z,y). 
Thus, d is a metric on /', because the other properties of d as a 


metric are obvious. 
(iv) Let Cj,») denote the set of all continuous functions on 


R. Let us define 


interval [a,b] C 
doo(a, y) = sup{|x(t) — y(t)| : t € [a, bf}, 


called the supremum metric. Because any continuous function on a 
closed and bounded interval assumes maximum and minimum 


values, the definition of d makes sense. The inequalities 
Jz(t) — y(4)| < |x(t) — 2(t)| + z(t) — y®)| 
< sup|a(t) — 2(t)| + sup|z(t) — y(t)| 


hold for all t € [a, b]. Therefore we have 


sup|ar(t) — y(t)| < sup|ar(t) — z(t)| + sup|2(t) — y()], 
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which is exactly the triangle inequality. Property (a) of a metric 
obviously holds. Hence d is a metric on Cy, 4). 


(v) Now, define another metric on Cj, 9): 


d(a,y) = fi la(t) — y(t)|dt. 


It is easy to see that d(x, y) = 0 if and only if x(t) = y(t) for all 
t € [a, b]. (See Problem 1.11). The triangle inequality is obvious. O 


PROBLEMS 


11 Let X =R and d(z,y) =sin?(x —y). Is (X,d) a metric 
space? 


1.2 Let X =Randd(z,y) = \/|x — y|. Is (X, d) a metric space? 


13 Let X =R". Define on X, d(x, y) = max{|z, — yz|: k= 
Lisvagtt} Ve = (Gig 0324); B= Wigsves Ga). Show that 
(X,d,,) is a metric space. 


1.4 Let d bea metric on X. Define 


d(a, 
p(@,¥) = Tea - 


Show that p isa metric on X. 


1.5 Two real numbers p>1 and q>1 are called conjugate 
exponents, if 


Show that for all x, y € R, and for conjugate exponents p and 


q, the following inequality holds. 


98 


1.6 


17 


1.8 
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Hint: Work with the function f(z) = < + ; —z!/P and then 
a? 


substitutez = =. 
y 


(a) Prove Hdélder's inequality for finite sums: for conjugate 
exponents p > 1 and q > | such that . + i oe a eee ee 
0: and 044.0450, 2 Uy 


1/q 


n n 1/pT n 
dr aidi < ba bu 
i=1 4=1 i=1 


Hint: Apply Problem 1.5 to x = a;/A and y = b;/B, where 


1/q 


n 1/p n 
A= ba and B= bul 
i=l i=1 


(b) Generalize Hélder's inequality for infinite sums. 


a) Prove Minkowski's inequality (for finite sums): for p > 1, 
Q1,---,QAn > O, and bj,...,b, > O, it holds true that 


n 1/p n 1/p n 1/p 
i=1 4=1 w= 


(Hint: Make use of (a + b)? = a(a + b)?-' + b(a +b)? | and 
then apply Hélder's inequality.) 


b) Generalize Minkowski's inequality for infinite sums. 


The Euclidean metric or Euclidean distance is defined in R"” 


by 
de(x,y) = be (te=de) (P1.8) 


t= (op song ah Y= (Uiaiees la) (P1.8a) 
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1.9 


1.10 


1.11 


[Specifically, if nm=1, we have d(z,y) = /(a#-—y)? = 
|c —y|.]| Show that d. is indeed a metric (Hint: Apply 
Minkowski's inequality.) 


In Problem 1.8 we defined the Euclidean metric on R” by 
equation (P1.8). This metric can be regarded as 


dp(x,y) = 4] D0 de(is ye)? (P1.9) 
k=1 
@ = (21,...,2n), Y = (Yt, ---5 Yn), (P1.9a) 


where d;,(xz, yx) 1s the one-dimensional Euclidean metric on 
the kth coordinate axis (Ath factor space). We can extend this 
notion and define a metric on the n-times Cartesian product set 
Y=Y,x.---x Y, by formulas (P1.9-P1.9a). This problem 
asserts that such d, is indeed a metric on Y. We call this met- 
ric the product metric and the corresponding metric space 
(Y,dp) the product space. In notation, x {(Y¥%, dx): k= 1, 
song fis 


Prove the statement: Let (Y;, d,), k=1,..., , bea collec- 
tion of metric spaces and let Y be the Cartesian product of 
Y1,.-., Yn. Then the function dp on Y x Y defined by (P1.9- 
P1.9a) is a metric on Y. 


Show that the function p(x,y) = \7_,dg(rx, yx) is also a 
metnc-on Y= ¥; x Yo x ++ * ¥,< 


In Example 1.3 (v), why is d(x, y) = 0 if and only if x = y? 
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2. THE STRUCTURE OF METRIC SPACES 


The structural properties of metric spaces stem from the notion of the 
open ball, with the aid of which we are be able to introduce open and 
closed sets, interior, closure, and accumulation points. Open balls, 
due to a particular metric, generate convergence and continuity, the 
principles of any analysis, which we explore in this chapter and 
Chapter 3. 


2.1 Definition. Let (X,d) be a metric space and let x € X and 
r > 0. The subset of X, 


B(z,r) = {ye X: d(a,y) <r}, 


is called the open ball centered at x with radius r (with respect to 
metric d). [Whenever we need to emphasize that the ball is with res- 
pect to metric d, we write it as By(x, 71). This notation makes sense if 
X is endowed with more than one metric. | O 


2.2 Examples. 

(i) The open ball B(x,7r) in Euclidean space (R,d-.) is the 
open interval (x —r, e+ 7). 

(ii) The open ball B(x,r) in Euclidean space (R?,d.) is the 
open disc centered at x with radius r in the usual sense. 


(iii) Different metric choices on a given carrier give rise to diffe- 
rent spaces and, as the result, to different shapes of open balls. 
Indeed, in metric spaces other than Euclidean, the shape of open 
balls may be quite surprising to our usual perception of them. 
Consider, for instance, an open ball B(x,r) in (R?,d), where d is 
the supremum metric defined as in Problem 1.3, for n = 2: 


doo (x,y) = max{|x1 — yi], |Z2 — yo|}. 


It is easy to see that the open ball B,.(x, 1) is square shaped and that 
the Euclidean ball B.(x,1) is inscribed in this square (see Figure 2.1 
below). 
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(iv) Let (X, d) be a discrete metric space with the metric defined 
in Example 1.3(7). Then, for any « € X, an open ball centered at x is 


ie}, el 
Ben =) r>1. 


Indeed, because d(x, y) is either 0 or 1, for r <1, B(x,r) = {y € 
X:0=d(a,y) <1}, whereas for r>1, B(x,r)={yEexXx: 
d(x, y) = 1}. 

(v) Let (X,d) be the metric space defined in Example 1.3(iv), 
where X = C;,5), and 


doo(x, y) = sup{|x(t) — y(t)|: t € [a, bf. 
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Then the open ball B.,(x,7) has a shape as depicted in Figure 2.2 
below. 


x(t) +r 


x(t) 
y(t) 
x(t)-r 


Figure 2.2 
O 


2.3 Definition. Let (X,d) be a metric space. A subset A of X is 
called a d-open set (or just open set) if every point x of A can serve 
as the center of an open ball inscribed in A, that is, there is an r > 0 
such that B(x, r) C A. O 


2.4 Examples. 


(i) Every open ball is an open set itself. Indeed, if x, € B(x,r) 
then r—d(x,21) >0. Take r; =r—d(ax,21) and show that 
B(a1,7T1) C B(a,r). For every z € B(x,r1), by the triangle in- 
equality, 


d(a,z) < d(a,01)+d(ai, 2) < d(z,a,) +71 =r. 


Thus z € B(x,r) (see Figure 2.3 below). 
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(it) The set [a,b), for a <b, in (R,d_.) is not open, because 
there is no open ball B(a,r) C |a, b). 


(iii) Any carrier X is obviously open. 


Figure 2.3 


(iv) A set A is not open if there is at least one point x € A such 
that there is no ball B(x, 7) that can be inscribed in A. The empty set 
does not have any point. Thus it is reasonable to assign it to the class 
of open sets. In other words, we define @ as an open set. 


(v) In the Euclidean space (R,d_.), IR is an open set but not an 
open ball (why?). O 


2.5 Theorem. For every metric space (X,d), the following 
statements hold true. 


(i) Arbitrary unions of open sets are open sets. 


(ii) Finite intersections of open sets are open sets. 
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Proof. 


(i) Let {A,: k € I} be an indexed family of open sets in X 


and let A=\) A;,. If  € A, then there is an i such that x € A;. 
kel 


Because A; is open, there is an r > O such that 


B(a,r) CA; CUA. 
kel 

Therefore, A is open. 

(it) Let A;,..., An be open subsets of X and let A = ()] A;. If 

k=1 

x € Athen xz € Ay, k= 1,...,n. It follows that there are r1,..., rn 
such that B(e,r,)CAp, b= 1,...,m. Let r= min ry,...T} 
Then, obviously, B(x,r) A@ and B(a,r) C Ay, k=1,...,n. 
Thus, B(x,r) C A and A is open. CO 


2.6 Remark. The intersection of more than finitely many open 
sets need not be open. The reason is that because r = min{r;, : 
k, € I} can be zero. For example, let 

A, =(1— 4,14 +) in(R,d.). 


n? 


Then 1€ A,, n=1,2,..., which implies that 1 © () A, and 


n=1 


hence {1} = {) A, . However, the set {1} is not open in (R,d.). O 
n=1 

2.7 Example. Let (X,d) be a discrete metric space. Then the 
power set P(X) coincides with the set of all open sets. Indeed, in 
Example 2.2 (iv), we showed that in any discrete metric space, every 
singleton {x} and the carrier X are open balls. In addition, @ is an 
open set. Any subset A of X can be represented as the union of all 
points of A. Thus by Theorem 2.5(2), it follows that A is also open. 
Specifically, in R endowed with the discrete metric, all singletons are 
open, whereas in Euclidean space (R, d.) they are not. C 


2.8 Definitions. 


(i) A point zc € AC X is called an interior point of A if there 
exists an open ball B(x,r) C A. The set of all interior points of set 


A is denoted by Aor Int(A) and called the interior of A. 
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(Clearly, Ais the largest open subset of A, which yields that A is 
open if and only if A = A. Indeed, let C C A be an open set, larger 


fe} fe} 
than A. Then, there is an x € C such that x ¢ A. But this is a 
contradiction, because x must be an interior point of A.) 


(ii) A subset A of X is called closed if its complement A° is 
open. [Specifically, the carrier X and the empty set © are both 
closed. Any singleton is obviously closed. ] O 


2.9 Proposition. Arbitrary intersections or finite unions of 
closed sets are closed sets. 


Proof. The statements follow from Theorem 2.5 by applying De 
Morgan's laws. O 


2.10 Example. Because the set of all open subsets of a discrete 
metric space (X,d) coincides with its power set, the set of all closed 
subsets is also the power set. Particularly, in a discrete metric space 
all subsets are simultaneously open and closed. O 


2.11 Definitions. 


(i) A point « € X is called a closure point of A C X if every 
open ball centered at x contains at least one element of A. We also 
say, “if every open ball centered at x meets A.” The set of all closure 
points of A is denoted by A or by C/(A) and called the closure of A. 
[From Definition 2.8 (2) it follows that A C A.] 


(it) There are two types of closures points. Type 1: x is an 
isolated closure point of A if there is a ball B(x,r) that does not 
contain any points of A, other than x. In other words, B(x,r)N 
(A\{x}) = @. Clearly, an isolated closure point must belong to A. 
Type 2: x is an accumulation point, if it is not an isolated point. That 
means, for each r > 0, B(a,r) M(A\{ax}) # G. Unlike an isolated 
closure point, an accumulation point need not belong to A. 


The set of all accumulation points of A is called a derived set, in 
notation A’. From the above definition it follows that A’ C A. 


(iit) A point x € X is called a boundary point of A if every open 
ball at x contains points from A and from A°. The set of all boundary 
points of A is called the boundary of A and is denoted by OA. O 
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2.12 Examples. 


(i) In (R,d.), let A = (0,2) U {5}. {5} is an isolated closure 
point, because B(5,r) contains only {5} for all r < 3. Thus, A= 
[0,2] U{5}, and A’ = (0, 2). 


(ii) Note that 0A = 0A° = AN A®. (See Problem 2.1.) There- 
fore, 0A (as follows from Proposition 2.13 below) is a closed set. 


(iii) In the next section we learn that the closure Q of all rational 
numbers in (IR,d,) is entire R. So is the closure Q¢ of irrational 
numbers. Consequently, by (iz), OQ = R, which is quite surprising. 
[Note that OQ = R also directly follows from Definition 2.11 (iz7).] 


(iv) The boundaries of X and @ are the empty sets. 


(v) Let A = [0,1)U {2}. Then, A= (0,1), A= [0,1] U {2}, 
A’ = [0,1], OA = {0,1,2} (because A° = (—00,0) U[1,2)U 


(2,00), A® = (—00, 0] U[1, 00), and AN A¢ = {0, 1, 2}). 


(vi) Let A = {1,5,5,---} C (R,d_). Because 0 is the limit of 
the sequence {+} (in terms of Euclidean distance), it is also an 
accumulation point of A. Any open ball at 0 contains at least one 
point of A. This is the only accumulation point of A. By the way, A 
is not closed, for 0 is a closure point of A. So we have A’ = {0}, and 
A= AU {0}. oO 

2.13 Proposition. For any subset A of X, A is a closed set and 
furthermore, A is the smallest closed superset of A. 


Proof. 


(i) We show first that A is a closed set, that is, (C/(A))° is 
open. Let x € (C/(A))°. Then there exists an open ball B(x,r) such 
that B(x,r)N A = @ (because, otherwise, x would belong to A by 
the definition). However, we have not proved yet that B(x,r)N A 
= @, which would immediately imply that (C/(A))° is open. Now 
we show that no point of B(z,r) is a closure point of A. Take an 
arbitrary point t € B(x,r). Because B(x,r) is an open set, there is 
an open ball B(t,r;) C B(x,r) also disjoint from A. By the 
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definition of a closure point, this means that t ¢ A. In as much as t¢ 
was an arbitrary point of B(x,r), B(x,r) C (CI(A))°. 


(ii) Now we show that the closure of A is the smallest closed 
set containing A. Let B be an arbitrary closed set such that A C B. 
We prove that B® C (A)°. Because B° is open, for each x € B®, 
there is an open ball B(x,r) C B°. This implies that B(z,r) NB 
= Mand that B(z,r)NA=@. 


Thus x ¢ A (by the definition of a closure point), which is equi- 
valent to x € (CI(A))°. Therefore, we have proved that x € B° 
yields that x € (CI(A))°, that is, B° C (CI(A))°. The latter is ob- 
viously equivalent to A C B. oO 


2.14 Corollary. A set A is closed if and only if A = A. 
(See Problem 2.15.) 


2.15 Remark. Consider the set C = C(a,r) = {y € X: d(x, y) 
<r}. It can be easily shown that C’ is a closed set. (See Problem 
2.4.) Such a C' is called a closed ball centered at x with radius r. 
Evidently, B(a,r) C C(x,r) implies that the closure of an open ball 
is a subset of C [i.e., B(x,r) C C(x,r)], because B is the smallest 
closed set containing B. However, C'(x,r) need not coincide with 
the closure of the corresponding open ball B(x, 1). For instance, let 
(X,d) be a discrete metric space, where any open ball is both a 
closed and open set: B(x,r) = B(x, r). Because 


Ly, ‘ee | 
cen ={ Y r>1 


we have- BU. 7) =} Cla ¢) =X forr > lor Ber) =Car) = 


{ao} forr <1, Fore =1, Bar) = {a} Cc Ca,7) =X, unless Xx 
is a singleton. O 


2.16 Example. The set of all rational numbers Q is neither open 
nor closed. Indeed, it is known that each irrational point x is a limit 
of a sequence of rational points {z,,}. Therefore, if x is irrational, 
there is no open ball B(x,7) that does not contain rational points. 
This implies that Q° is not open either, or equivalently, Q is not 
closed. On the other hand, Q cannot be open, because otherwise, 
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every rational point g could be the center of an open ball (interval) 
containing just rational numbers. This is absurd, in as much as any 
interval is a continuum. Therefore, the set of all rational numbers is 
neither open nor closed. It also follows that the set of all irrational 
numbers is neither open nor closed. O 


In Problem 1.9, we introduced a product metric. We wonder how 
open sets can look like in a product metric space. A remarkable pro- 
perty of such a metric is given by the following theorem to be proved 
in Chapter 3 for the general topological product spaces. 


2.17 Theorem. Let {(Y;,d;): k =1,...,n} be a finite family of 
metric spaces and let (Y,d) = x {(Yz,dx): k =1,...,n} be the 
product space. Then O © (Y, d) is open if and only if O is the union 


of sets of the form |[O;, where each O; is open in (Yj, d;). 


i=1 
PROBLEMS 


2.1 Show that0A = 0A° = AN A®. 
2.2 Is it true that A C B yields 

(a) ACB? 

(b) ACB? 
2.3. Show that [A‘]* C A. 
2.4 Prove that a closed ball C(x, 1) is a closed set. 
2.5 Ifa € OA, must x be an accumulation point? 
2.6 Show that A= AU A’. 


2.7 Let AC (X,d), where X is an infinite set. Show that if x is 
an accumulation point of A, then every open set containing x 
contains infinitely many points of A. 
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2.8 


2.9 


2.10 


2.11 


2.12 


2.13 


2.14 


2.15 
2.16 


2.17 


2.18 


2.19 
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Give an example of a continuum closed set that does not have 
any accumulation point. 


Describe the shape of open balls in the metric space (X, d) 
introduced in Example 1.3(22). 


Show that set [1, oo) is closed in the metric space of Example 
1.3); 


Prove that A = A + 0A. 


Show that AU B= AUB. 


Show that AN BC ANB. 


Show that AN BAAN B. 
Prove Corollary 2.14: A set A is closed if and only if A = A. 


Show that in any discrete metric space, the boundary of any 
subset is empty. 


Define S(z,r) = {y € X : d(x,y) =r} and call it a sphere 
centered at x with radius r(r > 0). In Euclidean space 
(R",d.), obviously, S(z,r) = OB(az,r). Is this true for metric 
spaces? If it is not, then is there at least a relationship between 
S(a,r) and OB(x,r) common for all metric spaces? Are there 
any exceptional cases? Explain. 


Show that set (0, 1] is closed in the metric space of Example 
1.3 (i). 


Let O be an open set in (R",d.). Show that for each x € O, 
there is an open ball B(q,7r) 5 x, such that g € Q” (the set of 
all vectors in R” with rational coordinates) and r € Q. 
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NEW TERMS: 


open ball B(x, r), 101 

radius of an open ball, 101 

supremum metric,101, 102 

open ball with respect to the Euclidean metric, 102 
open ball with respect to the supremum metric, 102, 103 
open (d-open) set, 103 

interior point, 105 

interior A or Int(A) of set A, 105 

closed set, 106 

closure point, 106 

closure A or C/(A) of set A, 106 

isolated closure point, 106 

accumulation point, 106 

derived set (A’), 107 

boundary point, 107 

boundary OA of set A, 107 

closed ball C(x, 1), 108 

sphere S(z,7r), 110 
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3. CONVERGENCE IN METRIC SPACES 


This section introduces the reader to one of the central notions in the 
analysis of metric spaces: convergence. Among different things, we 
discuss the relation between limit and closure points. 


3.1 Definitions. 


(i) A function [N, X, f] is called a sequence; its most com- 
monly used notation is {z,} =f, with x, = f(n). Let {x,} C 
(X,d) be a sequence and let ¢ > 0 and N be a positive integer. A 
subsequence Ty = {ty,2y41,--- } is called an N(e)-tail of {x,} if 
for any &m,Un € Ty, d(Lm, Xn) < €. The sequence {x,,} is called a 
Cauchy sequence, in notation 


litt :d( Gg; ty, = 0, 
m,n Co 


if for every « > 0, there is an N(e)-tail of {2}. 
(it) Let a € X. Any N(e)-tail, Ty, is called an N (2, €)-tail of 


{z,} if Ty C B(ax,e). The sequence {,,} is said to converge to a 
point x € X if for every e > O, there is an N(a, €)-tail. In notation, 


jim d(@n, x) = 0 


(also d-lim 2, = x or just x, — «). a is called a limit point of the 
sequence {x,,}. A sequence is convergent if it is convergent to at 
least one limit point that belongs to X. 


(zi7) A point x is said to be a sequential limit point of a set A if 
there is a sequence {x,,} C A convergent to x. 
(iv) A set A ina metric space (X,d) is called complete if every 


Cauchy sequence in A is convergent in A. Consequently, the metric 
space (X,d) is complete if so is X. 


(v) A sequence {x,} is called bounded if for every n, 
d(x1, 2») < M, where M is a positive real number. O 


3.2 Remark. A sequence in a metric space can have at most one 
limit point. Indeed, let x,y be limits of a sequence {,,} C (X,d) 


3. Convergence in Metric Spaces 113 


and let ¢ > 0 be arbitrary. Then, there are Ni (x, ¢/2) and No(y,€/2) 
tails such that 


(x,y) Sd, tn) + (ny) < § +56 


for all n > N = max{ Nj, No} [ie., d(x, y) can be made arbitrarily 
small]. Thus, 7 = y. O 


3.3 Theorem. Let A C (X,d). Then a point x is a closure point 
of a set A if and only if x is a sequential limit point of A (i.e., there 
is a sequence {x,,} C A such that %, > 2). 


Proof. 


(i) Let x be a closure point of A. If x € A then the proof becomes 
tiivial (if we set a, = 2, = 1,2,....), Leta e A\A. By the defini- 
tion of a closure point, every open ball B(z,r) meets A. Thus for 
every n, there is a point, x, € AM B(x, +), so that d(x,x,) < +. 
Therefore, {x,,} is a pledged sequence convergent to z. 

(ii) Let {a} CA such that lim, «2, =a. We prove that 
a € A. The convergence implies that for every ¢ > 0, there is a 
Tn = {xy, tn+1,---} such that Ty C B(x,e) and Ty C A. Thus, 
Ve >0, B(x,e) NA #@, which yields that x € A. 

Part (7) implies that if A is an infinite set and x € A’\A £@, 
ie., 2 is an accumulation point of A, then there is a sequence {z,,} 
with all distinct terms such that 7, — <x.) O 


3.4 Corollary. A subset A of a metric space (X,d) is closed if 
and only if it contains all of its sequential limit points. 


Proof. 


(i) Let A be closed and let {2,,} C A be a sequence convergent to 
a point x. Then, by Theorem 3.3, x is a sequential limit point and a 
closure point of A at the same time: 


lim ¢, =2€EA. 
nm CO 


114 CHAPTER 2. ANALYSIS OF METRIC SPACES 


Since A is closed, A = A and x € A. Thus, A contains all of its 
sequential limit points. 


(i) Suppose A contains all of its sequential limit points and let 
x € A. Then, by Theorem 3.3, there is a sequence {x,,} C A such 
that lim x, = x. By our assumption, x also belongs to A or, equi- 
valently, A C A implying that A = A and hence A is closed. O 

3.5 Definitions. 

(i) A subset A C (X,d) is called dense in X, if A= X. (By 
Theorem 3.3, A is dense in X if and only if the set of all limit points 


of A coincides with X, or, in other words, if and only if for every 
x € X, there exists a sequence {xz,,} C A such that x, — 2.) 


(it) Aset A C (X,d) is called nowhere dense if its closure has 
the empty set as its interior, that is, if Jnt(C/(A)) = ©. LO 


3.6 Examples. 


(2) Because each irrational number can be represented as the 
limit of a sequence of rational numbers, Q is dense in R (in terms of 
the Euclidean metric). 


(it) Let A = {1,5,10} C (R,d_.). Then A is nowhere dense. 


(iii) {4 : n =1,2,... } is nowhere dense in (R, d-). 


PROBLEMS 


3.1 Show that every convergent sequence is a Cauchy sequence. 
Give an example when the converse is not true. 


3.2 Prove that a set A C (X,d) is nowhere dense in X if and only 
if the complement of its closure is dense in X. 


3.3. Assuming that (IR, d.) is complete (a known fact from 
calculus), prove that (IR", d.) is also complete. 


3.4 Show that any Cauchy sequence is bounded. 
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3.5 Show that in a discrete metric space any convergent sequence 
has at most finitely many distinct terms. 

3.6 Show that any discrete metric space is complete. 


3.7 Show that if {x,,} C (X,d) is a Cauchy sequence and {z,,, } is 
a subsequence convergent to a point a € X, then, — a. 


3.8 Show that in (R",d.), B(x,r) = C(a,r), where C(z,r) is a 
closed ball of Remark 2.15. 
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sequence, 112 

N(a,¢)-tail, 112 

Cauchy sequence, 112 

convergent sequence, 112 

limit point of a sequence, 112 
sequential limit point of a set, 112 
complete metric space, 112 
bounded sequence, 112 

dense set, 114 

nowhere dense set, 114 
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4. CONTINUOUS MAPPINGS IN METRIC SPACES 


4.1 Definition. Let (X,d) and (Y,p) be two metric spaces. A 
function f : (X,d) — (Y,p) is called continuous at a point x) © X 
if for each e > 0, there is 6 > 0 such that p( f(x), f(xo)) < € for all 
x with d(x, x9) < 6. Function f is called continuous on X or simply 
continuous if f is continuous at every point of X. O 


4.2 Remark. (The anatomy of continuity.) Since for any func- 
tion, x € f* ({f(@)}), we always have x € f*(B,(f(20),€)) 
[the inverse image of an open ball in Y centered at f(2o)], the key 
question is whether xo is an interior point of f*(B,(f(xo),€)) for 
any € > 0. We show that f is continuous at xo if and only if the in- 


verse image under f* of any open ball centered at f(a) contains x 
as an interior point. (See Figure 4.1.) 


¥ 


F(%) 


| X, (interior point) 
a 


f' (Bf %)€) x 


Figure 4.1 
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If xp is an interior point of f*(B,(f(xo),€)), as conjectured, there 
must be an open ball Ba(xo, 6) C f*(Bp(f (xo), €)). It implies that: 
1) such a positive 6 exists, and 2) the image of By(xo, 6) under f, is 
a subset of B,(f(xo),€) (see arguments in Theorem 4.3 below), 
which guarantees that p(f(x),f(ao))<e for all x with 
AG, 25). <b: 

Now, if f is not continuous at xp, as depicted in Figure 4.2, 2 
need not be an interior point of f*(B,(f(xo), €)). If this is the case, 
no ball Ba(xo,6) can be inscribed in f*(B,(f(axo),€)) and 
consequently, no positive 6 exists to warrant that p( f(x), f(ao)) is 
less than ¢ for all x with d(x, x9) < 6. 


Y 


B,(L(%)8) 1 


| X, (non-interior point) 


Se 


f°(B, (Ff ),€) 


Figure 4.2 


On the other hand, the reader should not jump to a quick conclusion 
that Figure 4.2 (which serves for a mere illustration) necessarily re- 
presents a discontinuous function. For instance, if X is endowed 
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with the discrete metric, then x9 is an interior point of the set 
f*(Bo(f (xo), €)), because {xo} is an open ball with any radius 
r <1. Furthermore, f*(B,(f(o),€)) is an open set in X for all 
0, O 


The following theorem elaborates the above discussion and also 
generalizes the principle of continuity. 


4.3 Theorem. 4 function f :(X,d) — (Y,p) is continuous if 
and only if the inverse image of any open set in (Y,p) under f is 
open in (X,d). 

Proof. 


1) According to the blueprint in Remark 4.2, we begin the proof 
by showing the validity of the following assertion. 


f is continuous at x if and only if xo is an interior point of 
the inverse image under f* of any open ball B,(f (xo), €). 


Let xo be an interior point of f*(B,(f(xo), €)). Then, there is an open 
ball 
Ba(xo, 6) € f*(Bo( fo), ©); 


and because inclusions are preserved under f,, by Problems 3.6(a) 
and 2.6 of Chapter 1, 


f(Ba(ao, 6)) S f(F"(Bp(F(@0), €))) S Bo(F (#0), ©), 


which yields continuity of f at 79, as per Remark 4.2. 


Now, let f be continuous at x9. Then, the inclusion f,(Ba(xo, 6)) 
C B,(f(Zo), €) holds, which, along with Problem 2.5, Chapter 1, 
leads to the following sequence of inclusions. 


Ba(xo, 6) S f°(F(Balxo, 6))) S F°(Bp(F@o), €)). 


Because 2p is the center of By(xo, 6), it is an interior point of this 
ball and, due to the last inclusion, an interior point of the set 


f°(B(F (0); €)- 
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2) Suppose f is continuous on X. We show that for each open set 
OCY, f*(O) is open in (X,d). Pick a point x € f*(O). Then, 
f (xo) € fx(f*(O)) © O and, because O is open, f(xo) is its interior 
point. Thus, O is a superset of the open ball B,(f(xo),¢), for some 
€, and consequently, 


f°(BoF (#0), ©) S FO). (4.3) 


Because f is continuous at xg, by assertion 1), x must be an interior 
point of f*(B,(f (xo), €)), and, by (4.3), an interior point of f*(O). 
Thus, f*(O) is open. 

3) Let f*(O) be open in (X,d) for every open subset O of Y. 
Take xp) € X and construct an open ball 6,(f(xo),¢). By our 
assumption, the set f*(B,(f(%o),€)) is open in (X,d). Because 
f(xo) € B,(f (£0), €), we have that 


to € f*({f@o) S f(BF@o), €)) 


and, therefore, x) € f*(B,(f(Zo), €)) and it is an interior point of 
f*(B,(f(@o), €)). By 1), f must then be continuous at xp. O 


There is yet another useful criterion of continuity, known as 
Heine's criterion (after the German mathematician Heinrich Eduard 
Heine, 1821 — 1861). 

4.4 Theorem (H.E. Heine). 4 function f: (X,d) — (Y,p) is 
continuous at x € X if and only if for every sequence {x,,}, d-con- 
vergent to x, its image sequence { f(x,,)} is p-convergent to f(x). 


We prove a version of this theorem for a more general case in 
Chapter 3 (Theorems 4.9 and 4.10). 


4.5 Definition. Let (X,d) be a metric space and 7(d) be the 
collection of all open subsets of X with respect to metric d. Then 
T(d) (or just T) is said to be the topology on X generated byd. U 


Theorem 4.3 can now be rephrased as follows. 


4.6 Theorem. Let f: (X,d) — (Y, p) bea function and let r(d) 
and t(p) be the topologies generated by metrics d and p, 


4. Continuous Mappings in Metric Spaces 121 


respectively. Then f is continuous on X if and only if 
f"(r(p)) € r(d) fie, VO € r(p), f*(O) € r(4)]. O 


4.7 Example. Let f: (R,d) — (R,d.) be the Dirichlet function 
defined as f = 1g, where Q is the set of rational numbers. If d = d-, 
is the Euclidean metric then f is discontinuous at every point. If d is 
the discrete metric, by Theorem 4.3, f is continuous on R, because 
the inverse image of any open set in (IR,d.) under f is clearly an 
element of the power set coinciding with the discrete topology 
generated by the discrete metric (see Example 2.7). 0 


We are further interested in the conditions under which two dif- 
ferent metrics on X generate one and the same topology. This 
property of metrics satisfies an equivalence relation on the set of all 
topologies on X and hence is regarded as equivalence of metrics. 


4.8 Definition. Two metrics d; and dz on X are called equivalent 
if r(d,) = r(d2) (in notation d, & dy). O 

According to Definition 4.8, all metrics sharing one and the same 
topology 7 on X form one equivalence class, say [d]_. Therefore, if 
Dx is the set of all metrics on X, then Dx|E, is the quotient set of 
Dx modulo E,. 

4.9 Remark. Let (X,d,) and (X, dz) be two metric spaces and let 
f: (X,d4) — (X,db2) be the identity map (f(x) = x, x € X). If 
d, and dz are equivalent [i.e., (d,) = T(d2)], then for every open 
set O in (X,d2), f*(O) =O € r(d,). Thus f is continuous on X. 
According to Heine's Theorem 4.4, this is equivalent to the statement 
that 


lim .d;(%,,%) =0 
( oman®,2) 
implying that 


Jim ,,2( Fn), £(2)) = fim a(n, 27) = 0. 


In a nutshell, assuming that 


(i) r(di) = r(d2) 
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we showed that 


(it) jim ,di(@n, 2) =0s jim do(an, 2) = 0, 


By Heine's theorem, it follows that the converse is also true: state- 
ment (22) implies statement (7). Hence, we may call two metrics 
T(d,) and r(dz) on X equivalent if (7) or (47) holds. 

In short, two metrics d; and dz on X are equivalent if and only if 
any sequence {x,}C X convergent in d, is convergent in dz and 
vice versa. O 


From Theorem 4.3, it also follows that the identity map in Re- 
mark 4.9 is continuous “under equivalent metrics.” However, an 
identity map need not be continuous under d, and dz» if they are not 
equivalent. 


4.10 Definitions. 


(i) Let A be a subset in a metric space (X,d). The real number 
or infinity 
d(A) = sup{d(z,y): t,y € A} 


is called the diameter of A. The set A is called d-bounded or just 
bounded if d(A) < oo. Particularly, the metric space (X, d) (or just 
metric d) is called bounded if so is X. Set A is said to be unbounded 
if d(A) = co. 


(ii) A subset A in a metric space (X,d) is called totally 
bounded if for every « > 0, A can be covered by finitely many «- 
balls (i.e., balls with common radius ¢€). O 


4.11 Example. According to Problem 1.4, the function 


p(z,y) = il aia 


defined on a metric space (X, d) is a metric on X. Obviously 


lim (Zn, g)=0 


n> 
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if and only if lim, p(tn, x) = 0 (due to d = Ter ). Therefore, d 
and p are equivalent. Observe that p is clearly bounded whereas d is 
arbitrary. O 


We close this section with a short discussion on uniform conti- 
nuity. This concept is developed further in Section 6 and Chapter 3. 
Uniform continuity is an important notion in analysis. Real-valued 
uniformly continuous functions can be approximated by step 
functions and this is a core issue when proving the Riemann integra- 
bility of continuous functions. The generalization of uniform 
continuity for metric spaces is useful in more abstract settings and is 
relatively straightforward. 


4.12 Definition. A function f: (X,d) — (Y,p) is called 
uniformly continuous on X if for every ¢ > 0, there is a positive real 
number 6 such that d(x, y) < 6 implies that p( f(x), f(y)) < ¢, for 
every x,y € X. i 


In the case of usual continuity, a delta depends upon a particular 
point x € X, where the continuity holds, so that a common delta, 
good for all points x € X, need not exist. Unlike continuity, uniform 
continuity guarantees the existence of such positive 6 (for every 
fixed ¢) and for all points of X simultaneously. Clearly, uniform 
continuity implies continuity. 


Uniform continuity can also be defined on some subset A of X, 
so that in Definition 4.12, X will be replaced with A. 


4.13 Examples. 
(i) Consider f: (R, d.) — (R,d_) such that f(a) = x”. Then 


|zo —2| <6 
implies that 


|x + xo| = |x — xo + 225 
< |x — xq| + 2|x9| < 6 + 2| x0] 


and 
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|f(x) — f(xo)| = |z* — 26] = |x — ao] - |x + 20] 
< 6+ (6 +2|z9]). 


Take 6-(6+2|20|) as ¢. Then 6 can be found explicitly as a 
function of € such that 


6= rete — |xol. 


Therefore, function x ++ x? is d--continuous at every point 29 € R. 
However, x” is not uniformly continuous on R, because 6 depends 
upon 29 as well. Specifically, 6 — 0 when xv — oo. Consequently, 
we cannot find a 6 > 0 good for all xo. 


(ii) Let f(x) = x? be given as 
f: ({0,3],de) — ({0, 9], de). 
From the last inequality above we derive 
| f(x) — F(%0) | < 66 + 2|zo]) < 6(6 + 6). 


The latter inequality implies that 6 = ./9+ e-—3, where ¢= 
6(6+6). Thus d.(f(x), f(xo)) <é€ whenever d.(x,20) << 6= 
\/9 + € — 3. Because 6 is independent of xo, f(x) is uniformly con- 
tinuous. Observe that f has been given on a closed and bounded 
interval which provides the uniform continuity. However, in this 
case, f would also be uniformly continuous even if f were defined 
on any bounded but not necessarily closed interval, for instance 
(0,3) (why?). 

(iit) A continuous function can be uniformly continuous over 
unbounded sets as, for example, functions f(x) = +, « € [1,0o), 


and f(x) =sinz, ER. 


(iv) If (X,d) and (Y,,p) are two metric spaces, an isometry 
from X to Y is any bijective map [X, Y, f] such that 


d(x1,2%2) = p(f(21), f(x2)), forall 71,22 € X. 
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Obviously, such an f is uniformly continuous on X. 


(v) A function f from (X,d) to (Y,,/) is said to satisfy the 
Lipschitz condition on X if there is a nonnegative constant C’ such 
that for each x,y € X, p( f(x), f(y)) < Cd(a, y). The smallest such 
C is called the Lipschitz constant of f. The function f is uniformly 
continuous on X if f satisfies the Lipschitz condition. (See Problem 
4.17.) O 


There is an analytical result, known as the Heine-Borel theorem, 
stating that any continuous function defined on a closed and bounded 
set in any Euclidean metric space is also uniformly continuous. The 
general form of this result will be discussed in Section 6 (Theorem 
6.13). 


4.14 Definition. A function f from (X,d) to (Y, p) is said to be 
Cauchy continuous if the image under f, of every d-Cauchy 
sequence {2} [i.e., fx({2,})] is a p-Cauchy sequence. O 


4.15 Example. Notice that continuous functions do not map 
Cauchy sequences to Cauchy sequences, that is, continuous functions 
need not be Cauchy continuous. Let us introduce the function 
[(— 5,5), R, rtan], where rtan: = Res(_z z)tan, and set arctan: = 


rtan~!. Consider the sequence 


G2 Sancta 2), we 1 2s 5 


Then, although {z,,} is a Cauchy sequence in (R,d.), {rtan(zx,,)} is 
not. (See Problem 4.14.) O 


Notice that continuous functions do not map Cauchy sequences to 
Cauchy sequences as we learned it from Example 4.15. However, 
uniformly continuous functions do it. 


The statement below is an easy exercise. (See Problem 4.15.) 


4.16 Proposition. Jf f from (X,d)to(Y,p)is uniformly 
continuous, then it is also Cauchy continuous. O 


4.17 Remark. Suppose f : (X,d) — (Y,,) is a map. What can 
go wrong with f if it is not uniformly continuous? Suppose f is 
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continuous, but not uniformly continuous. Thus, given an « > 0 and 
an 2, there is a 6,: =6(x1,€) such that f,(Ba(x1,6,)) C 
B,(f(#1),€). Using the same ¢, we can find for another point x2 a 
69: = 6(%2,€) such that f,(Ba(x2, 62)) C B,(f(a2),e). If we con- 
tinue with this process point after point, it may happen that along 
some sequential path {x,,}, it is getting increasingly harder to 
“squeeze” f,(Ba(2n,6n)) into B,(f(an),€) while 6, becomes 
smaller and smaller, as we can see in Figure 4.3 below. 


f.(B,(%,5,)) 


B,(x,,8,) 


C) BT ( 


(x8 


B,(F(%).€) 
e 


X,),£) JIB ASO) 
Bi(83) Bf (a%),£) FAB, (%,55,)) 
: «(07 %,03 
° O 


Figure 4.3 


B, 
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So, along the path {z,,}, 6, | 0. The sequence {x,,} was brought for 
an illustration. In general, if f is not uniformly continuous and X is 
an infinite set, then there is no positive 6(€) good for all x € X such 
that f..(Ba(x,6(e))) C B,(f(x),e). However, if such a positive 
6(€) does exist, f is uniformly continuous. 


There is another way to say this. Suppose f is continuous on X 
and let « > 0 be chosen. Then for each x € X, there is a 6, = 6,(€) 
> O such that 7,(.B3(a,0,)) © By (a),e). Let At = {és(e) ta 
€ X}. If f is not uniformly continuous, then for at least one €o, 
infA = 0 and thus it holds for all e < & . i 


4.18 Remark. It is known from calculus that the space of all real- 
valued continuous functions defined on R” is closed under the for- 
mation of main algebraic operations. What if the functions were 
defined on an arbitrary space (X,d)? We give here some informal 
discussion on this matter. Let R* be the space of all real-valued 
functions defined on a set X and let f,g € R*. Define the follow- 
ing. 

(i) f+g is the function such that for each point x € X, 
(f+ g)(a) = f(x) + g(a). 

(it) fg is the function such that for each x € X, (fg)(x) = 
f(x) + g(a). 

(iit) +00 and —oo are not real numbers. Consequently, f/g is 
the function such that for all « € X, (f/g)(x) = f(x)/g(x), exclud- 


ing x € X for which g(x) = 0. At all those values, the function f/g 
is either undefined or should be specified. 


(iv) As a special case, any real-valued function multiplied by a 
real number is a real-valued function too. 


(v) The associative (relative to multiplication) and distributive 
laws of functions relative to the addition and multiplication defined 
in (2) and (iz) are the corresponding consequences of these laws for 
real numbers. 


Bearing in mind these observations, we conclude that the space 
R* with the above operations is a commutative algebra over the field 
R with unity, and a vector lattice (which was also mentioned in 
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Example 7.7(ix), Chapter 1). A subset C((X, d); (IR, p)) (of R*) of 
all continuous functions is a subalgebra characterized by the follow- 
ing properties. 


(a) f,geEC > aft+bgEC, Va,bER. 
(b) f,gEC > fg Ee. O 


PROBLEMS 


4.1 Show that if A is totally bounded then A is bounded. Give an 
example, where a bounded set is not totally bounded. 


4.2 Prove that C in Remark 4.14 is indeed a subalgebra with 
properties (a) and (0). 


4.3. Show that a continuous bounded function on a bounded inter- 
val need not be uniformly continuous. 


In Problems 4.4-4.8 it is assumed that f and g are functions 
from (R, d.) to (R, d.). 


4.4 Let f: ((—co,0),d.) + ((—co,0),d.) be a function given 
by f(x) = +. Show that f is continuous. Explain why f(z) is 


not uniformly continuous. 


45 Let f: A — R be a differentiable function such that its 
derivative f’ is bounded over A, where A is an arbitrary 
(bounded or unbounded) interval. Show that f is uniformly 
continuous on A. 


4.6 Show that if f and g are uniformly continuous on R and 
bounded then fg is uniformly continuous on R too. However, 
if f and g are uniformly continuous then fg need not be uni- 
formly continuous. 


4.7 Which of the following functions are uniformly continuous? 
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4.8 


4.9 


4.10 


4.11 


4.12 


4.13 


a) f(x) =sin’x (x € R). 

b) f(x) = xcosx (x € R). 

é) f(z) =esmnae (& € R) 

ad) f(a) =Ing (@ € |1,00), 

e) f(x) = x*Inz (x € (1, 100)). 
f\ f(a) =/f2,2€ Ry. 

9) fiz) = sina’. 


Let f be a continuous function and g a uniformly continuous 
function on a set A such that |f| < |g|. Is f then uniformly 
continuous? 


Show that in (R”, d.), any bounded set is also totally bounded. 


Show that in R”, Euclidean and supremum metrics are equi- 
valent. 


Let A be a nonempty subset in a metric space (X, d). Define 
the function f(x) = d(A,x) = inf{d(x,y):y € A} as the 
distance from x € X to set A. Show that f is uniformly 
continuous. 


Let (X,d) be a metric space and define the function 
p: (X?, X?) — Ry as p(X, y) = d(x1, ¥1) + d(x2, ya), where 
X = (%1, 22) andy = (yi, y2). (a) Show that p is a metric on 
X?. (b) Define the function f(x) = d(x,,22). Show that 
f : X? =R, is uniformly continuous on (X?, p). 


Let (X,d) and (Y,p) be two metric spaces. Let function 
[X, Y, f] be a map. Is the following argument true? “If f is not 
uniformly continuous, then there is an ¢ > O such that for each 
6 > 0, there is a set Ss C X with d : Ss x Ss — (0,6), and 


p: fx(Ss) x fx(S5) — [e, 00). 


If Ss is finite, then we choose 6 = min{d(x, y) : 2, y € Ss}. It 
shows that 6 is reducible to 6. Herewith we empty Ss and thus 
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4.15 


4.16 


4.17 


4.18 
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for all «,y € X,d(x,y) <6 and p(f(x), f(y)) < e. Thus Ss 
is infinite.” 


Give an example of a continuous function [X,Y, f] from 
(X,d) to (Y,p) such that f is not Cauchy continuous. 


Prove Proposition 4.16. Let (X,d) and (Y,p) be two metric 
spaces and let function [X,Y , f| be uniformly continuous on 
X. Then f is also Cauchy continuous. 


Prove the following statement. Let (X,d) and (Y,p) be two 
metric spaces and let |X,Y,f| and |X,Y,g] be two 
continuous mappings. The subset A= {x € X: f(x) = g(ax)} 
is closed. 


Recall [Example 4.13 (v)] that a function f from (X,d) to 
(Y, p) is said to satisfy the Lipschitz condition on X if there is 
a nonnegative constant C’ such that for each x,y € X, 
p( f(x), f(y)) < Cd(a, y). The smallest such C is called the 
Lipschitz constant of f. Show that f is uniformly continuous 
on X if f satisfies the Lipschitz condition. 


Let (X,d) be a metric space such that d is bounded, that 
means d(X) < oo. Is the function p defined on P(X) x P(X) 
as 


p(A, B) = sup{d(z,y): 2 € A,y € B} 


a metric on P(X)? If it is not, show which properties of a 
metric hold and which do not. 
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5. COMPLETE METRIC SPACES 


In this section we are concerned with completeness of metric spaces 
previously introduced in Definition 3.1(iv). According to Problems 
5.10 and 5.11, completeness is a stronger property of a subset than 
closeness. However, these two notions of a set coincide if the metric 
space is complete, as we learn it from the following theorem. 


5.1 Theorem. Let (X,d) be a complete metric space. Then, a 
subspace (A, d) is complete if and only if A is closed. 

Proof. Let A be closed and let {x,} CA be any Cauchy 
sequence. Because (X,d) is complete, there is a point x € X such 
that lim, _..6 Z, = x. Then, by Corollary 3.4, 2 € A. Thus, (A, d) is 
complete. Now, let (A, d) be complete and {z,,} be any convergent 
sequence in A. Then this sequence is also a Cauchy sequence and 
hence A contains its limit. Therefore, A is closed, again, by Corol- 
lary 3.4. O 


The reader should be aware of the differences between the notions 
of completeness and closeness of a subspace. (See Problem 5.4.) 


5.2 Theorem. 4 metric space (X,d) is complete if and only if 
every nested sequence {C(%n,1n)} of closed balls, with r;,| 0 as 
n — co, has a nonempty intersection. 


Proof. Because r,, | 0, for any « > 0, there is an integer v such 
that r, < se. Given thatk > n> vy, 


C(@is rk) Cc Ce Tr) s Cis Ta) 
and, consequently, 
dita) 2p se 


Therefore, {x,,} is a Cauchy sequence. 


Now, assume that (X,d) is complete. Then {xz,,} converges to a 
point, say x € X. In as muchas each ball C(x», r,,) contains the tail 


tg ndbee ssf 
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of the sequence {x,,} and because it is closed, it must contain «x. 
(oe) 

Thus, () C(a,,, rn) contains x and hence it is not empty. 
n=1 


Now, let any nested sequence of closed balls have a nonempty 
intersection and let {x;} be a Cauchy sequence in X. By Definition 
3.1 (222), it implies the existence of an increasing subsequence 
{v|, V2,... } of indices of {x;,} such that for each n, 


1 
(Ls, Ly,) < gat, for s > vp. 


We show that the sequence {Cn =C Cae 2)} is nested. Indeed, 
let y € C41. Then 


al 1 
dy, Ly) < gnt+1 and diy, 5 Dyis) < gnt+i+ 


Therefore, 
dy, i) < x, 


which yields that y is an interior point of C,, and thus C,, D C14. 


CO 
Because by our assumption, the intersection () C,, ~ @, there is at 


n=1 
least one point, say x that belongs to all balls. Furthermore, because 
the sequence {r,,} of their radii is convergent to zero, the 
subsequence {x,, } of their centers must converge to x € X and thus, 
by Problem 3.9, {;,} also converges to x. O 
5.3 Remark. In the final phrase of the last theorem, x is a unique 
love) 
point of the intersection (| C,,. Theorem 5.4 is a useful refinement of 
n=1 
this statement due to Georg Cantor. Because of its similarity to The- 
orem 5.2, its proof is suggested as an exercise (Problem 5.4). O 


5.4 Theorem (Cantor). Let (X,d) be a complete metric space 
and let {A,}| CX be a sequence of nonempty closed subsets with 


lim; 00 (An) = 0. 
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Then (| A, consists of exactly one element. O 
n=1 

5.5 Definition. A function LX, (Y, p), f] is called p-bounded if Y 

is a vector space and there is a nonnegative real number V/ such that 

p(f (x), O(x)) < M, Vax € X, where 0 is the function identically 

equal to @ € Y (the origin of Y). O 


5.6 Examples. 


(i) Let X be a nonempty set, (Y, p) a metric vector space, and let 
F,, = F(X; (Y, p)) be the set of all p-bounded functions from X to 
Y. For all f,g € F, define 


doo(f, 9) = sup{o( f(x), g(x)): x © X}. 


It can be shown (Problem 5.5) that d,, is a metric on F,, called a 
uniform (or supremum) metric. Consequently, the convergence in 
(F., ds.) 1s called the uniform convergence. A subset of functions 
F C F,, is said to be uniformly bounded on X if F is dx-bounded; 
that is, diam F = sup{d(f,g):f,g€ F} <M (a positive real 
number). 


We show that any Cauchy sequence in (F,,) is uniformly 
bounded. We will make use of Problem 5.5. Let {f;,} be a Cauchy 
sequence in (F,,d..). Therefore, for « = 1, there is an N = N(1) 
such that dsc(fny je) < 1; mk SN. Let kh = (1), Then, 


se Tia UW) at esl Jaa d a dx.(fr,0) i | + M(fn), 


where M( fy) is a “p-bound” of function fy. If M(f;) is a bound of 
f;, then M, defined as 


M = max{M(f_),...,M(fw-i),1+ M(fw)}, 


dx.-dominates the whole sequence {f,,}. By Problem 5.5, we have 
that { f,,} is d..-bounded. 


(ii) Assume that (Y,p) is a complete metric vector space. Let 
us show then that (F,,d,.) is also complete. Consider a Cauchy 
sequence { f;,} C (Fx, d..). It is obvious that for each fixed x € X, 
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the sequence { f,,(x)} is also Cauchy in (Y, ~). Because (Y, p) is by 
our assumption complete, the “pointwise limit” of {f,,} exists. 
Denote it by f. In other words, 


Jim, pUful), f(2)) =0, Ve € X. 
We need to show that f € (F,,d.). Because {f,,} is a Cauchy 


sequence, according to (7) it is uniformly bounded by a real number 
M. Thus we have 


p( f(x), 0(x)) < e(f(@), fn(x)) + p(fn(x), O(z)) 


< p(f (2), falZ)) + doo(fn,0) <M, 
that is, 
p(f(@), 0(@)) < p( f(a), fn(a)) + M. 
The last inequality holds for every x € X ifn — oo, which yields 


p( f(x), 0(2)) < M, forall x € X. 


Consequently, dx.(f,0) < M and hence, f € (Fx, do). 
We only showed that 
fr(x) 4, F(x), 


for each x € X, and that f € F,. The assertion f,, = f is subject to 
Problem 5.7. O 


PROBLEMS 


5.1 Using similar arguments as in Example 5.6, show that the limit 
of any uniformly convergent sequence of continuous bounded 
functions from (X, dp) to (Y,) is a bounded and continuous 
function. 
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Prove Cantor's Theorem 5.4. 


Let {C;,} be a sequence of closed balls in (R",d.) such that 
each of the balls C’, is centered at a point zg € IR” and has 


radius 4 , n= 1,2,... . Find the intersection {) C,,. 

n=1 
Show that if a metric space (X,d) is not complete then a 
closed subspace (A,d) need not be complete either. (Hint: 
Consider the metric space in Problems 2.9 and 2.10.) 


Show that d,,, defined in Example 5.6(2), is a metric on F,. 


Let F C F,(X;(Y,/)), where Y is a vector space. Prove that 
F is dx-bounded if and only if there is a positive constant 
such that for all f € F, d(f,0) < M. 


Show that in Example 5.6 (77) f, dog e 


We can make use of the fact that the Euclidean and uniform 
(supremum) metrics are equivalent (see Problem 4.10) to show 
completeness of (IR",d,.). For n = 1, it is well known from 
calculus. Prove completeness of (IR",d_) for an arbitrary n. 
(See Problem 3.5.) 


Let (X, d) be a metric space. A subset A C X is said to be of 
the first category if it can be represented as a countable union 
of nowhere dense sets. Otherwise, A is of the second category. 
Prove Baire's category theorem: a complete metric space is of 
the second category. 


Let (X,d) be a metric space and A C X be a complete subset. 
Show that A is closed. 


Show that a closed set in a metric space need not be complete. 
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6. COMPACTNESS 


Compactness is one of the key concepts in real analysis. We develop 
it in the present section for metric spaces and then revisit it in Chap- 
ter 3 for general topological spaces. It stems from the fact known in 
R that every bounded sequence has a convergent subsequence, which 
implies that any sequence in a closed bounded interval has a sub- 
sequence convergent to a point in this interval. In a general metric 
space, a subset A, in which every sequence has a subsequence con- 
vergent to a point in A, is called sequentially compact or just com- 
pact. Although compactness and sequential compactness are 
generally distinct notions in topological spaces (and they are defined 
in a different way), they are equivalent in metric spaces as Theorem 
6.3 states. 


Continuous functions defined on compact sets are uniformly con- 
tinuous; continuous images of compact sets are compact (hence, 
closed and bounded) and this means that in normed vector spaces 
(introduced in the next section), continuous functions on compact 
sets reach their maximum values. Further applications lead to the 
celebrated Ascoli and Ascoli - Arzela theorems [Dsh2]. 


6.1 Definitions. 


(i) A family of sets {A;: 7 € I} C (X,d) is called a cover of a 
set A C X if 
ACU Aj. 
tel 
Any subfamily of {A; : 7 € I}, that covers A is called a subcover of 
A. If {A; : 1 € I} is a family of open sets, then the corresponding 
cover (or subcover) is called an open cover (or an open subcover). 


(it) Aset A C (X,d) is called compact if any open cover of A 
has within itself a finite subcover of A, or we also say that “any open 
cover of A can be reduced to a finite subcover of A.” Correspond- 
ingly, (X,d) is a compact metric space if so is X. Notice that any 
finite subset is compact. Consequently, to avoid triviality, in all 
theorems below we assume that underlying sets of spaces are 
infinite. 
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(iit) A set A C (X,d) is called a Lindelfiset if any open cover 
of A contains a countable subcover of A (or can be reduced to a 
countable subcover). (X,d) is called a Lindelfispace if X is a 
Lindel6f set. A Lindeléf space was named after the Finnish 
topologist Ernst Leonard Lindeldf (1870-1946). O 


A noteworthy property of Euclidean spaces is given in the follow- 
ing classical result. 


6.2 Theorem (Lindeléf). Any subset of R” is Lindelflset in 
R",d.). In particular, (IR",d_-) is a Lindelf space. O 


(See Problem 6.7.) 


6.3 Theorem. For a subset A C (X,d), the following state- 
ments are equivalent. 


a 


(i) Ais compact. 


(ii) Every infinite subset of A has an accumulation point in A 
(in this case A is called Bolzano-Weierstrass compact). 


(iti) Every sequence in A has a subsequence that converges in A 
(A is called sequentially compact). O 


Sequential compactness of a subspace implies its completeness. 
(See Problem 6.6.) 


The proofs of the above statements are left for the reader (Prob- 
lem 6.8). 


Definition 6.4. A metric space is called separable if it has a dense 
countable subset. O 


Example 6.5. The Euclidean metric space (R, d.) is separable. A 
relevant dense countable subset of I would be Q, the set of rational 
numbers. Another example is the n-dimensional Euclidean metric 
space with the countable, dense subset Q”. O 


Theorem 6.6. Any compact metric space is separable. 


Proof. Let X be compact. It is easy to see that for each n € N, X 
can be covered by the family of open balls centered at every x € X 
with radius 4, Because (X,d) is compact, this open cover can be re- 
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duced to a finite subcover, such that [J B(x, +) contains X, where 
reF, 


By pace, ae Denote F’ = [J F,,, which obviously is a count- 
n=1 
able subset of X. We show that F is dense in X, that is, F = X. It is 
sufficient to prove that, for each y € X and r > 0, the open ball 
B(y,r) contains at least one point of the set F’, that is, y is a closure 
point of F’. Taking some y and r we choose any n such that t < (; 
Then if 
ye XC YU B(z,4), 
reF, 
there is a point x} ¢€ F,, such that y ¢ B (x7, r). This implies that 
d(x'' .y) <r and, therefore, zi) € B(y,r). Consequently, B(y,r) 
NF, £@ and Biy,r) NF #@. The proof of the statement is 
complete. O 


The following two theorems belong to central results in analysis. 


Theorem 6.7. Let A C (X,d) be compact. Then A is closed and 
bounded. 


Proof. 


(i) We show that A is bounded. Obviously A is covered by the 
family of open balls { B(x, 1): x € A}. Since A is ia this cov- 


er can be reduced to a finite subcover, that is, A C U B(ax,1) for 
k=1 


some integer h. Let M = max{d(z;,2;):i,7 =1,...,h}. Then M 
is finite. For any x, y € A, there are x; and x; such that « € B(a;, 1) 
and y € B(x;, 1). The following holds due to the triangle inequality. 


Gy) ade. ead eee) (ayy) <<, 


Therefore, A is bounded. 


(ii) We show that A is closed, that is, that A = A. Let x € A. By 
Theorem 3.3, there exists a sequence {x,,} C A such that rz, > 2. 
By Theorem 6.3, if A is compact, every sequence {x,,} C A has a 
subsequence that converges in A. By Problem 3.7, such a subse- 
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quence must have the same limit as {x,,}, that is, « € A. Therefore, 
A is closed. O 


The history of what today is called the Heine| Borel theorem (one 
of the most cited theorems in analysis) begins in the second half of 
nineteenth century in the work of the German mathematician Peter 
Gustav Dirichlet. The key issue was the concept of uniform continui- 
ty and the theorem stating that every continuous function on a closed 
interval is uniformly continuous. In his proof, Dirichlet implicitly 
used the existence of a finite subcover of a given open cover of a 
closed interval. He presented this proof in his lectures taught in 
1862, which were published only in 1904. The first known proof of 
the theorem was rendered in 1872 by Emile Borel and not by 
Heinrich Eduard Heine (who, by the way, was a student of Dirichlet). 
However, Borel's formulation was restricted to countable covers and 
he used techniques similar to those that Heine used to prove that con- 
tinuous functions on closed intervals are uniformly continuous. Con- 
sequently, Heine's name was attached due to the similarity in Heine's 
and Borel's approaches. Henri Lebesgue in 1898, among a few 
others, generalized the Heine - Borel theorem to arbitrary covers. 


Theorem 6.8 (Heine- Borel). 4 set A C (R",d.) is compact if 
and only if A is closed and bounded. 


Proof. 


(i) If A is compact, it is closed and bounded as a special case of 
Theorem 6.7. 


(iz) If A C (R”,d,) is closed and bounded, d,. (x,y) < M < «~, 
Va,y € A. Let y and a = (a1,...,@,,) be two elements of A. Then 


we have: 
ja] = faz < 4/22 (ax — 0)? = de(a, 8) 
k=1 


— d.(a, y) + de(y, 0) < M+ 4d.(y, 9), ,= i ere 2 


where @ denotes the origin of IR”. Because each y has a finite 
distance to the origin, every other point of A, like a, has also a finite 
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distance to the origin bounded by M+ d_(y,0). Note that even 
though d.(a, 0) < oo, d-(a,@) in general, need not have a uniform 
bound, unless A were bounded, which is assumed. Now we show 
that any d.-bounded sequence in R” has a convergent subsequence. 


The steps below represent an appropriate selection procedure. Let 
{x} C A. Then {2',} C R is a bounded sequence of i-coordinates 
(the ith-component sequence), 7 = 1,...,n. A bounded sequence 
need not to converge but does have a acme subsequence. 


. } with the limit 


point x). Select from the second-component sequence, the subse- 
quence with the same indices {x?,,2?,,...}. This subsequence is 
also bounded and hence contains a convergent subsequence, say 
{xz,,x7,,...}, with a limit point x), so that the set of indices {ky, 
ko,...} C {ri,ro,...}. If we return to the enon quence cetera 


.. } and select single out the subsequence {aj,,2;,, -.. }, then this 


For i = 1, let such a subsequence be {z! 


ry? ae 


subsequence is also convergent and has the same limit 2"), 


We can continue this process by ae the third-component 
sequence, selecting the subsequence {xj ee .} and from this 


sequence a convergent subsequence with a limit point x). Then the 
above first- and second-component subsequences will be reduced to 
the those with the indices from the third selection and so on. Let 
a = (2,..., 2) be the limit of the selected subsequence of {2; }. 
In as much as A is closed, x must belong to A. 


Therefore, we proved that an arbitrary sequence in A has a con- 
vergent subsequence in A, implying that A is sequentially compact. 
By Theorem 6.3, A is then compact. O 


6.9 Remark. The second part of the Heine-Borel theorem does 
not hold for general metric spaces. That is, if A is closed and bound- 
ed, it need not be compact. For example, let _X be an infinite set and 
let d be the discrete metric (which is finite) on X. Then X is closed 
and bounded. 


Now consider 


X CU B(z, 1). 
rEX 
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Because each of the balls covers just one point, the open cover 
{B(a,1): « € X} cannot be reduced to a finite subcover. Therefore, 
X is not compact. CL 


The following theorem is very important in many forthcoming 
applications. 


6.10 Theorem. Let f : (X,d) — (Y,p) be a continuous function 
and let K be a compact subset of X. Then the image f,.() is com- 
pact. In short, the image of a compact set under a continuous func- 
tion is compact. 


Proof. Take any open cover {O;: 7 € I} of f,(K) to have f,(K) 
C U O; . Then, 


wel 


KC f*(fe(K)) SUF (O}). 
iel 
Because f is continuous, f*(O/) is open, and because K’ is compact, 
there is a finite subcover of K by sets f*(O}), without loss of gene- 
rality indexed by 1, ...,, that is, 


KC UF (OY. 
k=1 


Because by Problem 3.6(a, b), Chapter 1, maps preserve inclusions 
and unions, we have 


The last inclusion is due to Problem 2.6, Chapter 1. O 


6.11 Remark. Let f: (X,d) — (R,d_) be a continuous map and 
let A C (X,d) be compact. Then, by Theorem 6.10, f(A) is com- 
pact in (IR, d.). By Theorem 6.7, f(A) is then closed and bounded, 
which means that the diameter of f(A) equals some M < oo. As 
mentioned in part (i7) in the proof of the Heine-Borel Theorem, this 
implies that all points of f(A) have a finite distance (i.e., are bound- 
ed by some Mo) to the origin, or equivalently, | f(a)| < Mo, for all 
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x € A. We have therefore shown that a continuous real-valued map 
on a compact set assumes a minimum and a maximum value. O 


6.12 Examples. 


(i) In (R,d-), R is closed but not bounded. Therefore, by the 
Heine-Borel theorem, R is not compact. 


(it) Take as A C (R,d_) the set (0, 1] which is bounded but not 
closed and therefore is not compact. We use a different argument to 
prove noncompactness of (0, 1]. Consider the open cover of A given 
by the family of sets {(4, 2): n = 1,2,... }. Obviously, 


lo) 

U (4,2) =(0,2)2 4. 
n=1 

It is not possible to select any finite subcover of A, for no finite 

subcover would include the point 0. Yet another argument that A is 

not compact (due to Theorem 6.3(ii)) is that the sequence {+} does 


not converge in A. O 


A continuous function need not be uniformly continuous, unless it 
is defined on a compact set, which is a widely referred to result 
known for Euclidean spaces. 


6.13 Theorem. Let f: (X,d) — (Y,p) be a continuous function 
and let (X,d) be compact. Then f is uniformly continuous on X. 


Proof. Let f be continuous at x. Then, for each ¢ > 0, there is a 
6, > 0, such that 


AF), FW) <5 


for all y with d(x,y) < 6,. Because X is compact, after reduction, 
there is an n-tuple of open balls such that 


i=1 
Let 6 = }min{6;,,...,5,,} and let x,y be such that d(x,y) < 6. 
Then x € B(x;, bx, /2) implies that 


Gle,24) < 5 On, 


6. Compactness 145 


and 
a(x, y) = d(x, y) ar az, a;) < 6 + 5 bx, = Ces 


Thus, y belongs to the ball B(x;, 6,,). Because y and x; are within 
the distance of 6,.,, due to continuity of f at z;, givene, 


AF (xi), FY) < 5. 


Obviously, d(x, x;) < 6,, yields p(f(x;), f(x)) < 5 and, therefore, 


AF@), FY) < PF), F@) + pF); FY) < €. O 


Here is a very useful criterion of compactness. 


6.14 Theorem. 4 metric space (X,d) is compact if and only if it 
is complete and totally bounded. 


Proof. 


(i) Let (X,d) be compact. Then by Problem 6.6, it is complete. 


Because X C |) B(x, €) for some ¢ > 0, by compactness, the cover 
cEX 

{ B(x, €): « € X} can be reduced to a finite subcover, which implies 

total boundedness. 


(iz) Let (X,d) be complete and totally bounded. We show that 
(X, d) is sequentially compact, which, by Theorem 6.3, would imply 
compactness. Let {,,} be a sequence in X. We construct a Cauchy 
subsequence. 


X is totally bounded. Therefore, it can be covered by finitely 
many open balls of radius 1. Then, at least one of the balls, for 
instance Bj, contains infinitely many terms, say {x;}, of this se- 
quence. Furthermore, we cover X by balls of radius 5 and again an 
infinite subsequence {x7} C {x;,} (because B, will also be covered) 
is contained in one of the balls, which we label By, and so on. 


The desired Cauchy sequence is formed by the selection of the 
first term from each subsequence. Indeed, by the construction, x} and 
x? belong to ball B;. Thus, d(x}, x7) < 1. Then, x? and x7 belong to 
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ball Bj, which implies that d(a7,2?) < 4, and so on. Because 


a? 


(X,d) is assumed to be complete, this Cauchy sequence must be 
convergent, thereby yielding sequential compactness of (X,d). UO 


PROBLEMS 


6.1 


6.2 


6.3 


6.4 


6.5 


6.6 


6.7 


6.8 


Show that if {a,} C (R",d.) with d(x,,0) <3, then {x;,} 
has a convergent subsequence. 


Define for each pair A, B € (X,d),d(A, B) = inf{d(a, b) : 
a € A,b € B}. Let A be compact. Show that VB C X, J an 
x € Asuch that d(x, B) = d(A, B). 


Let A, B C (X,d) such that A is compact and B is closed. If 
AN B=@Q, show that d(A, B) > 0. 


Let A C (X,d). Show that if A is totally bounded then A is 
also totally bounded. 


Generalize Theorem 6.6: Any Lindelfmetric space is separ- 
able. 


Show that sequential compactness of a subspace implies its 
completeness. 


Prove Theorem 6.2. Any subset of R” is Lindelf\ set in 
(IR", d.). In particular, (IR",d.) is a Lindelf space. 


Prove Theorem 6.3: For a subset A C (X,d), the following 

statements are equivalent. 

(i) Ais compact. 

(ii) Every infinite subset of A has an accumulation point in 
A (in this case A is called Bolzano-Weierstrass com- 
pact). 

(iit) Every sequence in A has a subsequence that converges 
in A (A is called sequentially compact). 
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6.9 Show that a mapping f : (X,d) — (Y, p) is continuous if and 
only if, for any compact subset Kt of X, Resx f is continuous. 


6.10 Show that a closed subset of a compact metric space is com- 
pact. 


6.11 Let K be the family of all compact subsets of a metric space 
(X,d). List those set operations which are contained within K. 
Give counterexamples for all set operations, which are not 
valid in K. 
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7. NORMED VECTOR SPACES 


We have already mentioned that the Euclidean metric defines the 
length of a vector in n-dimensional Euclidean vector space. The fol- 
lowing generalizes the notion of vector length in a vector space and 
reconciles it with the notion of a special metric defined on a vector 
space (initially discussed in Section 5). 


It is hard to say when exactly normed vector spaces were born. In 
1912, the Austrian mathematician Eduard Helly published an impor- 
tant paper about bounded functionals on the Banach space C;, yj fol- 
lowed by his habilitation work (submitted in 1918) and his seminal 
paper of 1921, “Uber Systeme linearer Gleichungen mit unendlich 
vielen Unbekannten”, Monatshefte friMathematik und Physik , 31, 
60-91. Here he considers subspaces of C' and defines a norm on 
one such subspace X, although he neither calls it the “norm” nor 
does he use the common notation || - ||. However, he uses instead the 
word Abstandfunktion (distance function) D: X — R, such that, 
with x € X, also axe X (a € F), with D(az) = |a|D(x), with 
z,yEexX, also r+yEX and Dix+y) < D(x) +D(y), and 
finally, D(x) = 0 implies that « = @. Although norms in various 
forms had been introduced earlier, Helly was the first one to utilize 
the relationship between the norm and convexity. Shortly thereafter, 
in 1922, Stefan Banach defended his thesis (submitted in 1920) in 
which a well structured concept of the norm in abstract vector spaces 
has been laid out. And not only that: Banach also considered 
complete normed vector spaces, which Maurice Fréchet in 1928 
suggested be named Banach spaces. 


7.1 Definition. Let (X,d) be a metric space such that X is a 
vector space over a field F (which is R or C). The metric d is said to 
be: 


a) translation invariant if for all a, 7, ye X, d(w+a,y+a)= 
d(x, y). 


b) homothetic if for all ac€ F and z,yEeX, d(axr,ay) = 
jald(x, y). 
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If a metric d is translation invariant and homothetic we abbreviate it 
by TIH. O 


If d is a metric on a vector space X, then we are able to measure 
the length of vectors, thus comparing them by setting the distance 
from any point x € X to one fixed point of X, the origin. If, in 
addition, d is TIH then we can use the properties of X as a vector 
space, and in some particular cases, employ even geometry, thereby 
emulating the Euclidean space and preserving the generality needed 
in applications. 


7.2 Definition. Let d be a TIH metric on a vector space X, with 
the origin 0, over F (assuming that F is R or C). Then for all x € X, 
we call the distance d(x, @) the norm of vector x and denote it by 
\|a||. We also call || - || the norm on X induced by the TIH metric d. 
The pair (X, || - ||) will be referred to as a normed vector space 
(NVS). O 


7.3 Theorem. Let || - ||be a norm on X in Definition 7.2. Then, 
the following properties of || - || hold true. 


(i) |2l=0 = «c=2@, 


(44) |lax|| = |a| |||], Va e F, Va € X. 
(iat) lz + yll < llell + llyll, Va,y € X. 
( 


iv) The norm || - || is a continuous mapping from (X, || - ||) to 


(Ry,|- 1). 


Proof. Property (2) is obvious. 


(it) |lax|| = d(axz, 6) = d(azx, aé) 
= |a|d(z, #) = |al||z\). 
(iit) |x +yl| =d(w+y, 0) =d@, —y) 


< d(x, 6) + d(@, — y) = ||| 
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+ |= 1 {lyll = [lel + Ilyll. 


(iv) From inequality (1.2c) we have 
d(x, @) — d(z, )| = |llall — ll2lll S$ d(@, 2) = lla — zl]. 7.3) 


Thus, if f(a): =||z|], we have from Definition 4.1 that 
f : (X,|| -|)) — (R,|- |) is continuous at x if for each e > 0, there 
is a 6 > 0 such that | f(x) — f(z)| < e for all z © X with the pro- 
perty that ||~ — z|| < 6. Now, letting « = 6 we have the latter hold 
due to inequality (7.3). O 


7.4 Remark. Conversely, if ||- || is a real-valued nonnegative 
function defined on a vector space X and has properties (7-722) of 
Theorem 7.3, then || - || generates a TIH metric on X by setting 
d(x,y) = || x — y || (show it, see Problem 7.6). O 


7.5 Definition. If d in Definition 7.2 is a TIH pseudometric, then 
the function || - || is called a seminorm and correspondingly, the pair 
(X, || - ||) is called a seminormed vector space (SNVS). 


It is easy to show that the Euclidean metric d, on R” is TIH. The 
associated norm induced by d, is called the Euclidean norm and it is 
denoted || - ||, . 


A very important class of NVS's is introduced below. 


7.6 Definition. An NVS is called a Banach space if it is complete 
with respect to the metric induced by the norm (or the norm induced 
by a TIH metric, as per Remark 7.4). O 


7.7 Examples. 


(i) The NVS (R", || - ||.) over the field R with ||a|| = Led 
| i=l 


is a Banach space with the Euclidean norm (see Problem 7.1). 
(ii) The NVS I? (p > 1) over the field C with the norm |||, 
= [0° |x,|"]!/” is a Banach space. Observe that || - || » indeed 


defines a norm (called the J? norm). (See Problem 7.5.) Now let 
{2} be a Cauchy sequence. Then as a Cauchy sequence it is 
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bounded (see Problem 3.4), say, by some M € R,. Let x = (x, 
az2,...) be the pointwise limit of the sequence {x')}. This limit 
exists, because each x; is the limit of the zth-component sequence in 
(C,d.) which is complete. First we need to show that x is an ele- 
ment of 1? (that is, ||z||,, < 00) and secondly, that 


c) 4g 


(i.e., {a'”)} converges to x in 1? norm). 


We have 


/ 1/p 


: oe lhe (n) a(n) 
[Siew] =| Shee af + of 
k=1 k=1 
(by Minkowski's inequality, Problem 1.7 (a), 
n 1/p n _1/P n 7 1/p 
| > (ai + uy < | Sat] - | tr 
i=l i=1 5 


with a, = xy — xi") and by = x\”) 


Now, letting m — oo, we have 


i 1/p 
bia <M, 
k=1 
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which holds for all r=1,2,... . Hence, with ||z||, <M, we 
showed that x € 1?. 


The proof that x) — «x in 1? norm is left for an exercise. (See 
Problem 7.7.) So, /? is complete and therefore it is a Banach space. 


(iit) Let F(X) be the space of all bounded functions on X 
valued in (R,d.) or (C,d.). Obviously, F, is a vector space. The 
norm ||f||,, = sup{|f(a)|: 2 € X} is called the supremum norm. 
The space (F(X), || - ||.) is then an NVS. The convergence in such 
a space (i.e., In supremum norm) is referred to as the uniform 
convergence. Now, (F;.(X), || - ||,,) is a Banach space (see Problem 
74). 


(iv) Consider Ci.) 48 the space of all n-times differentiable real- 
valued functions on a compact interval [a,b]. It is easily seen that 


Cia b| is a vector space. We introduce the following norm in Cia Dk 


If lls = WF lloo + IF oo #22 + FO Uy 


Clearly, || - ||, is a norm in Ci.)° We show that C7’ ,) is a Banach 


[a,b] 
space under this norm. Let { f;,} be a || - ||.-Cauchy sequence. Then, 
for every € > 0, there is a positive integer N such that Vk, 7 > N, 


fim fellot I-A llot + IF -F2 lo xe, 


which implies 


Ae 


<e,i=0,1,...,n. 
(oe) 


Therefore, by the well-known theorem from calculus (cf. Theorem 
4.2, p. 508, in [Fish]), there exists a function g;: [a,b] — R to 


which the sequence {f (), j=1,2,...} converges uniformly and g; 
j g M4 


is continuous, 7 = 0, 1,...,. On the other hand, it holds that 
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FEM) — FEM) = f FO (wan, 


[a,x] 
Palys PSUS as. 


Let k — oo in the above equation in the sense of the pointwise con- 
vergence. Because Fo and g; are bounded functions, by the 
Lebesgue dominated convergence theorem (see forthcoming Chapter 
6 for a rigorous proof of this theorem), we may interchange the limit 
and the integral and have 


gi-1(z) — gi-i(a) = fog(u)du, i=1,...,n. 
[2,2] 


Consequently, we conclude that g;_; is differentiable on [a,b] and 
g,_, (x) = g;(x). Thus go € Ci) implying that || fi, — golly — 0 


and C”” 


[a,b] is a Banach space. 


(v) The set Q of all rational numbers with the Euclidean norm 
|| - ||, is an NVS, but it obviously fails to be Banach. 


(vi) In Example 1.3(iv) we introduced the vector space Ci, of 
all continuous functions on interval [a,b] C R and defined the 
supremum metric 


d(x,y) = sup{|a(t) — y(t)|: t € [a, bl}, 


which induces the supremum norm 
lal]. = sup{|a(t)|: t € [a, ]}- 


Now, the space (Cj,1j, || - ||,,) is Banach. Indeed, let {x,} be a 
Cauchy sequence. Then, for each ¢ > 0, there is an N(e)-tail of 
{z,}. Thus, for each t € [a, 6], |v»(t) — x, (t)| < €, meaning that 
for each t € [a, b], {v,(t)} is Cauchy in (R, || - ||.). Because the lat- 
ter is complete, we conclude that x,,(t) converges pointwise to some 
function x(t). Consequently, |x,,(t) — x,(t)|<e¢ for all m,n> 
N(e) (with m — oo for each t € [a, b]) implies that 


|x(t) — x,(t)| < ©, holding Vt € [a, 5] 
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and thus 
le || 55 Se 


The latter means that {x,,} converges to x uniformly on [a,b] and 
therefore, from calculus we know that x is continuous. Hence, 
(Cian; Il - Ilo) is Banach. [The result also follows from (iv)]. 


(vii) Let Ri ») denote the space of all real-valued Riemann 


integrable functions on interval [a,b]. Obviously, Ri ») iS a real 
vector space. Introduce for an element x € Ri b) the function 


lel = [?le(@) lat, (7.7) 


which obeys all properties of a norm except that ||z|| = 0 does not 
imply that x(t) = 0 for all t € [a,b]. This is because there are inte- 
grable functions that can differ from x on infinite sets, even un- 
countable sets. (For the reader familiar with analysis, a function y 
can differ from zero function 0 on a continuum set of Lebesgue 
measure zero and it must be almost everywhere continuous. We 
discuss these and other aspects of integration in Euclidean spaces in 
Chapter 6, Section 3.) Thus the function defined in (7.7) is a semi- 


norm on Ri. 
[a,b] 


As we have done for metrics, we can convert || - || to a norm and 
(2, ap || - ) to an NVS by turning to the quotient NVS space 
a ply Il - |) , where || - || (notice the bold face of || - ||) is the 
restrictor of || - ||. Obviously, an element of Re olEl will be an 


equivalence class [a] of all integrable functions that are equal 
“almost everywhere on [a, b].” O 


The following property of NVS's asserts that the summation of 
their elements and multiplication by a scalar are two continuous 
functions with respect to the norm. This property makes NLS's an 
important subclass of “topological vector spaces.” 


7.8 Proposition. Let X be an NVS. Then, the functions 
(i) p:X xX — X defined as y(x,y)=x+y 
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(ii) w:F xX — X defined as (a,x) = ax 
are continuous. 


Proof. (7) By Theorem 4.4, y is continuous if and only if for 
every convergent sequence (2,,Yn) (x,y), the sequence 
p(2n; Yn) > v(x, y), and this obviously is the case, because 


Il (Zn ar Un) = (x si y)|| 
= [Zn — 2|| + ll¥n — yl — 0 asn— oo. 
The proof of (77) is similar. oO 


We close this section with the theorem playing a major role in the 
next topic on finite-dimensional NVS's. 


7.9 Theorem. Given a linearly independent set {x1,...,%»} C 
(X, || - ||), there is a positive constant C' such that, for any n-tuple 
of scalars Q1,...,Qn, € F (Ror C), the following inequality holds. 


sop Hell > CIE lal. (7.9) 

Proof. Note that (7.7) holds trivially if }*"_,|a;| = 0. We thus 

assume that at least one a; 4 0. Then, with G;: = a;/>>*_,|a;|, we 
have 

p21 Feil] = C, such that S77 4|G:| = 1, (7.9a) 


which is equivalent to (7.9) if at least one of (;'s is not zero. That is, 
we need to show that given a linearly independent set of vectors 
L1,-+--+,;Xn, there is a positive constant C’ such that for any an n-tuple 
of scalars G;,..., 8, with >>" ,|G;| = 1 the inequality 


i= Bieil] > C 


holds. Assume the opposite, that is, for any n-tuple of linearly 
independent vectors 21,...,%,, any n-tuple of scalars (;,..., Bn 
with )>*"_,|G;| = 1, and any positive constant C, 


oi Giaill < C. 
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Therefore, for any k = 1,2,..., there are n-tuples of linearly in- 
dependent vectors x,,...,2,, and an n-tuple of scalars a, shang io ) 
with S77, | ("| = 1 such that 


[aca 


1 
< E. 


Then, 


—-Oak—- ow. 


ona 


Now, the sequence { a, Ber ke) :kEN } is obviously bounded. 
Thus, by Bolzano-Weierstrass theorem, the sequence { a :kEN } 
has a convergent subsequence vee :steN } whose limit we 
denote by 6,. Then, from the subsequence ae :si' EN } there is 


ko buen 
a convergent subsequence {3 Di ge n} whose limit is 69. 


Continuing sifting every next subsequence we finally arrive at the 
subsequences 


nae SIS n} = 1 OigeeesOr}s 
such that 


— 0 ass” — oo 


n kin 
[panes ei 


and 


i 


ae) = 


and thus by continuity of the Euclidean norm, 
116 | = 1. (7.9b) 


On the other hand, by continuity of norm || - || (Theorem 7.3(iv)), 
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n Kgn n 
rea al] S IL Sxl = 0. 


Thus )>;_,6;2; = 0. Because by (7.9b) not all 6;'s are zeros, 21,..., 
x», must be linearly dependent, which is a contradiction. O 


PROBLEMS 


7.1. Show that (R”, || - ||.) defined in Example 7.7 (i) is an NVS 
and then show that it is a Banach space. 


7.2 Define the space 1° as the set of all bounded sequences x = 
{x1,2,...} CC. Show that I° is an NVS with the norm 
defined as ||z'|| = sup{|a;|: 7 = 1,2,...}. 


7.3 Define the space c C 1° as the subset of all convergent sub- 
sequences and let cy C c be the set of all sequences convergent 
to zero. Show that c and cg are normed vector subspaces of /°° 
with the same norm as that in Problem 7.2. 


7.4 Let F,(Q) be the space of all bounded real-valued functions on 
Q. Show that F, is a vector space. Let ||f||,, = sup{|f(w) 
w € Q} be the supremum norm defined in Example 7.7(cit). 
Show that the supremum norm in F, is indeed a norm and 
show that F,, is a Banach space with respect to this norm. 


7.5 Show that || - ||,, in Example 7.7(72) is a norm. 


7.6 Let ||- || be a real-valued nonnegative function defined on a 
vector space X over a field F (which is R or C) and let it have 
properties (i- 777) of Theorem 7.3. Show that || - || generates a 
TIH metric on X by d(x, y) = ||x — y|]. 


7.7 Show that the pointwise limit x of the sequence {a} in 
Example 7.7(i7) is also an /?-limit. 
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8. FINITE-DIMENSIONAL NORMED VECTOR SPACES 


The notion of a finite-dimensional normed linear space was intro- 
duced by the German mathematician Hermann Minkowski (1864 - 
1909) in his monograph Geometrie der Zahlen (Teubner, Leipzig, 
1896). To honor his contribution, such spaces are often called 
Minkowski spaces. 


We begin this section with the following significant property of 
finite-dimensional NVS's: 


8.1 Theorem. Any finite-dimensional NVS is a Banach space. 


Proof. Let n be the dimension of an NVS X and let {b;,...,6n} 
be a Hamel basis for X. Let {x;,} C X be a Cauchy sequence. Then, 
every element of the sequence can uniquely be expressed as 


cp = o_o, (8.1) 


Because {x;,} is Cauchy, for every « > 0, there is an N(e)-tail 
{tn,£@n+41,---} of {xz} such that ||v, —2m|| < © for all elements 
of the N(e)-tail. Since b;,...,b, are linearly independent, due to 
Theorem 7.7, there is a universal constant C' > 0 such that 


ee — aml] = [Es [at — af], 


n (k) 
ae — a; 


good for all k, m > N(e). 


Therefore, for all k and m greater than or equal to N(e), 


(k) __ (m) 
a; 


(m) 


aoe <e/C. 


Consequently, i ae: is a Cauchy sequence in (F, | - |) and the latter 


is complete (whether F is R or C). If a; is the limit of 
al) i=1,...,n, then the element x: = 5>;"_,a;b; belongs to X 


and we have by the triangle inequality: 
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k 
"for — af” |. 


|e = axl] = |] OR fai - af], 


For each 2, a”? 


Tan > 9, there is an NN; (air Ta ) tail of {a(t} such that for all 


is convergent to a;. Therefore, for each choice of 
terms of the N; (air =) -tail, the inequality la — al”| a Ten holds. 
If N = max{M,,..., N,}, then 


ja = eel] < Eales — a Bil] < © (8.1a) 


for all k > N, that is, for each c, there is an N(e)-tail of {x;,} with 
property (8.1a). Thus, the sequence {2;,} is convergent. O 


Combining Theorems 8.1 and 5.1 we arrive at the following. 


8.2 Corollary. Any finite-dimensional subspace of a normed 
vector subspace is a closed subspace of this space. O 


According to Theorem 6.7, every compact subset of a metric 
space is closed and bounded. The converse of this theorem was part 
two of Theorem 6.8 (Heine - Borel) holding for the Euclidean space. 
Beyond the Heine-Borel theorem, we learned that this converse did 
not hold in discrete metric spaces. It stands to reason to investigate in 
which spaces the converse of Theorem 6.7 is valid. 


8.3 Definition. We say that an NVS (or metric space) has the 
Heine - Borel property if every closed and bounded subset of the 
space is compact. O 


8.4 Theorem. Every finite-dimensional NVS space has the Heine 
- Borel property. 


Proof. Suppose A is a closed and bounded subset of an n- 
dimensional NVS X. Let {a;,} C A be a sequence and {b),...,b,} 
be a Hamel basis for _X. Then, 


oy = Yr, ab)d;, k =1,2,... (8.3) 
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Because A is bounded, the sequence is also bounded, that is, there is 
a constant M such that ||2;,|| <M for each k. Due to Theorem 7.7, 
there is a constant C’ > 0 such that 


lexl] = [oLaa! 


al”, (8.3a) 


k 


Inequality (8.3a) yields ; f < M,i=1,...,n. By the Bolzano - 


Weierstrass theorem, {a{"} has a convergent subsequence for each 


k. Repeating the same process as in the proof of Theorem 7.7, we 


can extract from the sequence {a\", ba a) KEN } a convergent 


subsequence jor ae. whose limit we denote by 


{61,...,6n}. Consequently, by (8.3), the sequence {x;,,} has a sub- 
sequence {2} that can easily be shown to converge to point 


= yey E A, 


in as much as A is closed. In other words, A is sequentially compact. 
Therefore, by Theorem 6.3, A is compact. O 


The following theorem by Frigyes Riesz sheds more light on the 
Heine - Borel property for NVS's, where we establish a necessary and 
sufficient condition for it. 


8.5 Theorem (F. Riesz). An NVS (X, || - ||) Aas the Heine - Borel 
property if and only if it is finite-dimensional. 


To prove this theorem we need the following. 


8.6 Lemma. Suppose (X, || - ||) is an infinite-dimensional NVS. 
Then, there is a linearly independent sequence {x,,} C S(0,1) 
(where S(@,1) is the unit sphere centered at zero) such that for each 
pair x;, x, of distinct elements, ||x; — x,|| > 1. 


Proof. We use the induction principle to prove this statement. 
Assume that for n > 1, there is an n-tuple {z1,...,2,} of linearly 
independent elements all belonging to the unit sphere 0B(0,1) such 
that |v; — x,|| > 1. Denote A = span{xj,...,2,}. Let z € A® and 
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let 6 = d(z, A°). Because, by Corollary 8.2, A is closed, A° is open 
and thus there is a ball B(z,1r) C A‘ implying that 6 > 0. 


Now, since A is a subspace, it follows that +A = {42:2 € A} 
= A. Denote z = z/6. Because the metric induced by the norm is 
homothetic, we have 


1 = $d(z, A) = d(z, A). 
From the latter it follows that there is a sequence {y,,} C A such that 
llyn — 2o|| > 1. 
(Indeed, 1 = d(z, A) = inf{||zo — z|| : c € A} means that for each 
€ > 0, there is an x € A such that 1 < ||zp — z|| < 1+.) 


Now, we have 
l[Ynll < [lyn — zoll + |l2ol| < R + |lzo|] = :M. 


Therefore, {y,} C C(@0,M)N A, which is a closed and bounded 
subset of A. Because A has the Heine-Borel property, C(0, MM) A 
is compact and thus by Theorem 6.3, {y,} has a convergent sub- 
sequence {y,,, } that has a limit point, say y, in A. Because ||y,, — zo]| 
— 1, we must also have ||y,,, — 2o|| — 1 and thus 


1 = d(z, A) < lly — 2oll 
S [ly — Ynell + lyn. — 20l] = O+1 


implying that ||y — zo|| = 1. 


Define 41 : = y — 2%. We show that x,,, fits the pattern of the 
elements from {71,....,2}. 


(i) Obviously, tn41 € S(8,1). 

(i7) If ani is linearly dependent with {21,...,2,}, then 
1 %2i = 2% — y.Because ye A,clearly y= )7),G;0; and 
hence zo = >>;_,(a; + G;)a; and thus must belong to A. So, it 
implies the linear independence of the tuple {x,..., @,41}. 
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(iiz) Finally, we show that ||”; — &n41|| > 1. In as much as A is a 
subspace, A = A — y ify € A and thus 


1 = d(Zo, A) = d (Zo —Y; A y) = d(Sn41; A) 
< d(Sn41; x) — l2n41 = ||, for each x € A. 
In particular, it holds true for x; € {x1,...,%,} C A. 


Thus, if we continue this process, we indeed collect a sequence 
{x,,} as the lemma claims. O 


Now we are back to the proof of the theorem. 


Proof of Theorem 8.5. Suppose that X is infinitely dimensional 
and let S(0,1) be the unit sphere in X. Obviously, S(6,1) is 
bounded. Furthermore, S(0,1) = C(@,1)\B(6,1), where C(6,1) is 
the unit closed ball, and thus $(6,1) = C(0,1)M B°(6,1) is the 
intersection of two closed subsets. By Lemma 8.6, there is a se- 
quence {z,,} C S(0,1), which due to the property ||x,, — x,|| > 1, 
is not convergent, nor can it have a convergent subsequence. Thus, 
by Theorem 6.3, S(6,1) is not compact. Therefore, X fails to have 


the Heine - Borel property. O 
8.7 Definition. Two norms || - ||, and || - ||, on a vector space X 
are called equivalent or similar (in notation, || - ||, ~ || - ||,) if there 


are two positive constants kK < M such that for any x € X, 
Kall, < llell, < Mell. a 
We show that on a finite-dimensional space, all norms are equiva- 


lent. 


8.8 Proposition. The similarity in Definition 8.7 defines an 
equivalence relation on the set of all norms on X. O 


(See Problem 8.1.) 


8.9 Proposition. Jf || - ||, and || - ||, are two equivalent norms on 
a vector space X and if {x,,} is a sequence in X, then 


Il lle 


(4) 2, Neh SX, OS 2. 
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(ii) {a,} is Cauchy in || - ||, if and only if {z,,} is Cauchy in 
II - Ilo. 


Proof. Because || - ||, ~ || - ||, there is a positive constant M 
such that 


Ilzll, < Mla]. (8.9) 


(i) Leb a, ae Then, for any ¢ > 0, there is an N(e) tail 


{tn,2v+i,---} of {x} such that ||2—<2,||<e¢/M for all 
In € {Zn,LN41,.-- }. From (8.9) we have 


2 — Lally < M||z — ay||) << Me/M =e. 
Obviously, the roles of the norms can be interchanged. 
(ii) If {x,} is || - ||,-Cauchy, then for each c > 0 there is an 
N(e) tail {ay,rn41,...} of {x,} such that ||a, — x,|| < ¢/M for 


all @m,2n € {@n, LN+1,--- }. From (8.9) we have 


2m — Ln\lp < Mam —2nl|, << Me/M =e. 


Obviously, the roles of the norms can be interchanged. O 

8.10 Corollary. /f || - ||, and || - ||, are two equivalent norms on a 
vector space X, then (X, || - ||,) is Banach if and only if (X, || - ||,) 
is Banach. O 


8.11 Example. In n-dimensional Euclidean space the norms 
Ize = Vite + eR 
and 
Fale = max{|21|, ae) [ae fs 


where x = (11,...,%)', are equivalent. Indeed, from 
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2 
(alee Pre, 
we have 
Illes > Fell Ue 


On the other hand, obviously, || - ||,, < || - ||. Thus, with the choice 


of K = gy and M = 1 we have 


Kll- lle <M too SMI Mle: U 


Example 8.11 can be generalized in 


8.12 Theorem. /n a finite-dimensional vector space all norms are 
equivalent. 


Proof. Let {b;,..., bn} be a Hamel basis for X and let || - ||, and 
|| - ||, be any two norms on X. Then, if x € X and thus is 
representable as x = )>,"_,a;b;, by Theorem 7.9, there is a positive 
constant C’; such that 


lel, = oeveebill, = Crea lail. 
Let nes = ma |b; ||524 = 1a, }. Then, 


Ilz|l5 = D-lail|lbill = mys. ai < M||2\I,, 


where M = ¢-. Analogously, 


1 
lItl, <x 


X|\5- O 


8.13 Definition. Let X be an n-dimensional vector space over F 
(which is R or C) with a Hamel basis {b,, ...,b,}.For anyx € X 
such that x = }>;"_,aib;, a; € F, define 
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zl, = f Sealed. (8.13) 


According to Problem 8.2, this is a norm on (X,C), called the 
standard norm. O 


Notice that the standard norm is also a norm on F”. 
8.14 Remarks. 


(i) Proposition 8.9(2) reconciles the notions of equivalence bet- 
ween two norms || - ||, and || - ||, and two metrics d; and d2 induced 
by || - ||, and || - ||, (see Remark 7.4) in any NVS. Indeed, according 
to Remark 4.9, any two equivalent norms generate the same topology 
in an NVS, i.e. 7(|| - ||,) = 7(|| - 2). If in addition, an NVS is finite- 
dimensional, then by Theorem 8.12, any two norms induce the same 
topology. Furthermore, any norm in a finite-dimensional NVS 
induces the same topology as that of the standard norm. 


(ii) The standard norm in (8.13) clearly resembles the n-dimen- 
sional Euclidean norm, which according to (i) induces a unique 
topology on (R, || - ||), with respect to any norm || - ||, called the 
usual topology, in notation (R, 7). O 


PROBLEMS 


8.1 Prove Proposition 8.8. The similarity in Definition 8.7 defines 
an equivalence relation on the set of all norms on X. 


8.2. Prove that the function || - ||, in Definition 8.13 is a norm. 


8.3. Define the function f : F” — R, as 


f(a1,.-+, Qn) = [base a,b; ||. 


Prove that f is continuous with respect to || - ||. 


168 CHAPTER 2. ANALYSIS OF METRIC SPACES 


Problems 8.4-8.6 offer an alternative proof to Theorem 8.12 by 
showing that any norm || - || on a finite-dimensional NVS X (which 
is assumed throughout) is equivalent to the standard norm || - ||,. 


8.4 Let (X, || - ||) be a finite-dimensional NVS and || - ||, be the 
standard norm defined in (8.13). Show that there is a positive 
constant / such that for each x € X, 


||z|| < Mlle, 


8.5 Let (X, || - ||) be a finite-dimensional NVS. Let S(0, 1) denote 

the unit sphere in F” centered at zero and f be the function on 
F” defined in Problem 8.3. Show that f reaches its maximum 
and minimum values in R, on S(0, 1). 


8.6 Let (X,]|| - ||) be a finite-dimensional NLS. Show that there is 
a positive K constant such that 


Kll- lle <ll- Ib 
=e k 
8.7 Let || - ||,,---, || - ||, be norms in R” and let || - || = >> 7_,|| - |l,- 
Show that ||- || is a norm in R” and that || - || induces a 


topology, identical to the usual topology. 


8.8 Let Y be a finite-dimensional subspace of an NVS (X, || - ||). 
Suppose there is a bounded sequence {y,,} C Y. Show that 
this sequence has a subsequence convergent in Y. (This gene- 
ralizes a known result from calculus that any bounded se- 
quence in IR” has a convergent subsequence. ) 
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Minkowski, Herman, 160 

Minkowski spaces, 160 

Heine -Borel property, 161 

Riesz, Frigyes, 162 

Riesz criterion of Heine - Borel property, 162 
equivalent (similar) norms, 164 

similar (equivalent) norms, 164 

||x||,. standard norm, 167 

(R, 7), usual topology, 167 


Chapter 3 
Elements of Point-Set Topology 


1. TOPOLOGICAL SPACES 


In Definition 4.5, Chapter 2, we called the collection of all open sets 
T(d) of a metric space (X,d) the topology induced by a metric. We 
recall that this collection of open sets or topology is closed with 
respect to the formation of arbitrary unions and finite intersections. 
We understand that the topology of a metric space carries the main 
information about its structural fingerprint. For instance, equivalent 
metrics possess the same topology. In addition, through the topology 
we could establish the continuity of a function (see Theorem 4.6, 
Chapter 2) without need of a metric. This all leads to an idea of 
defining a structure more general than distance on a set, a structure 
that preserves convergence and continuity. 


Mathematics historians are not in complete agreement about the 
roots of topology and who should get full credit for being its initia- 
tor. Most consider that topology, as the theory of structures, has its 
basis in the work of the German mathematician Felix Hausdorff, 
who published his fundamental monograph, Grundzuge der Mengen- 
lehre (Principles of Set Theory), in Leipzig in 1914. It was “imme- 
diately” preceded by Maurice Fréchet's 1906 pioneering introduction 
to metric spaces. (Notice that contemporary topology has branched 
out into several specialized areas, such as general topology, algebra- 
ic topology, and combinatorial topology. The very topology founded 
by Hausdorff was what we now refer to as general topology, also 
called point-set topology, which is deeply bound to classical analy- 
sis.) Nicolas Bourbaki [Bou2], regarded German Bernhard Georg 
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Riemann's work (his doctoral and habilitation theses and a paper on 
Abelian functions) from 1851 to 1857 as revolutionary and qualified 
him as the creator of topology, because he was the first to recognize 
where topological ideas were needed. In 1870, Georg Cantor (appar- 
ently inspired by Riemann's work), in connection with the represen- 
tation of real-valued functions by Fourier series, was concerned with 
the characterization of sets on which the function's value can be 
altered leaving the series invariant. This yielded more advanced con- 
cepts of topological accumulation point (introduced earlier by Karl 
Weierstrass), derived set, closed set, connected set, dense set and 
others that further led to the topological big bang. The very word to- 
pology was introduced for the first time in 1836 by the German 
Johann B. Listing, who used this as the notion of a “new analysis.” 


Topology has evolved further ever since. Most of the fundament- 
al results in general topology were developed in works by Germans 
Felix Hausdorff, Heinz Hopf, and Hermann Weyl, Russians Pavel 
Alexandrov and Pavel Urysohn, Poles Stefan Banach, Kazimierz 
Kuratowski, and Waclaw Sierpinski, American Eliakim H. Moore 
and James Alexander, and an influential Bourbaki group of mostly 
French mathematicians. (Nicolas Bourbaki is a pseudonym chosen 
by a group of young mathematicians in France in 1935 to represent 
the essence of a “contemporary mathematician.” The founding 
members were all connected to the Ecole Normale Supérieure in 
Paris and included Henri Cartan, Claude Chevalley, Jean Coulomb, 
Jean Delsarte, Jean Dieudonné, Charles Ehresmann, René de Possel, 
Szolem Mandelbrojt and André Weil. After World War II they were 
joined by the Polish - American Samuel Eilenberg. ) 


1.1 Definition. Let X #4. A collection 7 of subsets of X is 
called a topology on X or a family of open sets, if: 


(i) X,OEr. 
(i) {O;:¢4E IT} Cr => JO; er. 
iel 
(iii) Tis M -stable, that is,O,,O02 € T => ON Oe ET. 
[Observe that property (277) implies inductively that the inter- 
section of any finite collection of open subsets will also be open.] A 
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carrier X endowed with a topology 7 is said to be a topological 
space. The topological space is denoted by (X,7T). 


1.2 Examples. 


(i) Let (X,d) be a metric space and let r(d) be the topology 
generated by the metric d (see Definition 4.5, Chapter 2). Due to 
Theorem 2.5, Chapter 2, the collection of all open sets generated by 
metric d contains all arbitrary unions and finite intersections. More- 
over, @ and X are also open, so that 7(d) is indeed a topology as 
defined above. We call it a metric topology. For instance, the topol- 
ogy in R” generated by the Euclidean metric d, was called the usual 
(or standard or natural) topology and it is denoted by 7 [see 
Remark 8.14 (ii), Chapter 2]. 


(it) Let X be a nonempty set. Then the pair {X, O} = 7 is a 
trivial example of a topology. It is obviously the smallest topology 
on X, and it is called the indiscrete topology. Another trivial 
example of a topology is P(X), the collection of all subsets of X. 
This is the largest possible topology on X, and it is called the 
discrete topology. 


(iti) For A C X, 7 = {X,@, A} is a topology “induced by set 
A.” 


(iv) Let X = R= RU {—co} U {+00} be the extended real 
line. Let 7. C P(X) be the following collection of sets. 


O € 7, if and only if 
1)ONRE%X. 


2) oc € O or —co € O, then there is an a € R such that 
(a, co] C O or ana € R such that [—oo, a) C O, respecti- 
vely. 


Then 7, is a topology on R (see Problem 1.1) referred to as the 
extended real line topology. 


(v) Let (X,7) bea topological space and let Y C X. Define 
the system of subsets ty = {OM Y: O € T}. We show that ty is a 
topology on Y. Indeed, Y and @ obviously belong to ty. Let {U; : 
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i € I} C ty. Then, Vi € TI, there is O; € 7 such thatO; NY = U; € 
ty. Now |J O; €7 and therefore, YN O; € ty. On the other 


tel tel 
hand, due to the distributive law, 


YoUO;=VUVN0;) =UGen. 


ie. el iel 


It can similarly be shown that ty is closed with respect to the 
formation of all finite intersections. Therefore, Ty is a topology on 
Y C X, called the relative topology of tT on Y. The pair (Y, Ty) is 
called a subspace. For instance, take the Euclidean metric space 
(IR, d.) and let Y = [0, 1]. Then the set (5, 1] isopenin(Y,7y). O 


1.3 Remarks. 


(i) Let X be a nonempty set and let 7 and 7’ be two topologies 
on X.If 7 C7’, then we say 7 is weaker (or smaller or coarser) 
than Tt’. We also say that 7’ is stronger (or larger or finer) than T. As 
follows from Examples 1.2(ii) and (iii), tT C 7 C P(X). The in- 
discrete topology is, therefore, the coarsest topology on X, whereas 
P(X) is the finest topology on X. 


(it) If (X,d) is a metric space and 7(d) is the topology induced 
by d (also called the metric topology), then (X,7(d)) is said to be a 
metrizable (topological) space. Therefore, a metrizable space is a 
topological space with a topology that comes from some metric. UO 


1.4 Definition. Let (X,7) be a topological space. A subset 
AC X is called r-closed or just closed if A° € r. We symbolically 
denote 7° the collection of all 7-closed subsets of X. Of course, 7° is 
not the complement of 7 with respect to P(X). O 


As in the case of metric spaces, we can easily prove that X and @ 
are closed, finite unions of closed sets are closed, and arbitrary 
intersections of closed sets are closed. 

In Definitions 1.5 below we introduce some important notions for 
topological spaces. It is beneficial to support these definitions by 
examples immediately following the introduced notions. To refe- 
rence the examples, we assign them letter D followed by the prefix 
of the definition. 


1. Topological Spaces 175 


1.5 Definitions. 


(i) Let (X,7) be a topological space. A subset A C_X is called 
a neighborhood of a point x € X if x belongs to some open subset of 
A. Specifically, if A € 7 then A is called an open neighborhood of x. 


[Example D1.5(i). Let X = R and r = {R, @, {1}, (3, 4], {1} U 
(3, 4]}. Then {1} is an open neighborhood of 1. Because [3,5] 
contains (3, 4], it is a neighborhood of 34, (— 2,0) is not a neigh- 
borhood of — 1, since there is no open neighborhood that contains 
— 1 other than R.] 


nS 


(it) A point x is called an interior point of a set A if A is a 
neighborhood of x. [In the above case, — 1 is not an interior point of 
( — 2,0).] The set of all points interior to A is called the interior of 


A and is denoted by Aor by Int(A). 


[Example D1.5(it). In Example D1.5(i), 1 is the interior point of 
the set {1}. The interior of set A = [3,5] is A = (3, 4].] 


(iit) The collection of all neighborhoods of a point x € X is 
called the neighborhood system at x and it is denoted by U,. (The 
letter U¢ stems from the German word Umgebung which means 
neighborhood.) An arbitrary subcollection B, CU, is called a 
neighborhood base at x (or a fundamental system of neighborhoods 
of x), if every neighborhood U, € U, is a superset to least one 
B, € B,. Any element B, € B, is called a base neighborhood. 
Clearly, 2/,. itself is a neighborhood base at x. Obviously, B, is a 
neighborhood base at x if and only if there is another neighborhood 
base D,. such that every base neighborhood D,. € D, is a superset to 
at least one base neighborhood B,, from B,. 

[Example D1.5(iii). Let {B(z,+), n=1,2,...} be the 
sequence of d,-open balls centered at a point x € R”. Clearly, it is a 
base neighborhood at x. Another neighborhood base at x, which 
contains the above neighborhood base, is the system of all open balls 
with rational radii, centered at 7. We can also take the system of all 
open balls with positive real radii, centered at x. This system 
contains the first two neighborhood bases. } O 
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A neighborhood base 6, at x is in general a more “economical 
system” of neighborhoods than the whole neighborhood system U/, ; 
and, as it will be shown, it is as informative about the structure of the 
space in the vicinity of x as U/, is. Technically, it is of greater 
advantage in various proofs to use a base neighborhood than to use 
an arbitrary neighborhood. 

As follows from the definition, an arbitrary set A need not be a 
neighborhood of all of its points. For instance, [0,1] is not a neigh- 
borhood for points 0 and 1 in the usual topology (IR, 7-). More about 
the nature of neighborhoods is contained in the following proposi- 
tions that the reader can easily verify. 


1.6 Proposition. A C X is a neighborhood for all of its points if 
and only if A is open. O 


(See Problem 1.4.) 


1.7 Proposition. A is the largest open set contained in A. 
Furthermore, A= A if and only if A is open. O 


(See Problems 1.5.) 


In particular, it follows that A is open if and only if A = A 
1.8 Definitions. 


(i) a € X is called a closure (or adherent) point for a set A if 
any neighborhood of x has a nonempty intersection with A. We also 
say that any neighborhood of x meets A. The set of all closure points 
of A is called the closure of A and it is denoted by A. (Sometimes, 
when working with relative topologies it is necessary to emphasize 
that the closure of A is with respect to the carrier X, and it is more 
reasonable to use the notation Cly A. However, for brevity we still 
use the notation A, whenever X is the only carrier in mind.) 


[Example D1.8(i). In the topology introduced in Example 
D1.5(i), let us take A = ( — 2,0). Then we have 


A = (—o0, 1) U(1,3] U (4, 00), (1.8) 
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and A‘= Q. Indeed, for any x belonging to sets (—oo, 1), (1,3) or 
(4,00), R is the only neighborhood of x; thus RN(— 2,0) 4@. 
Observe that 1 is not a closure point of A, since {1} is a neighbor- 
hood (of 1) such that {1} N A = ©. For set B = { — 1} we have 


B = (—o0, 1) U(1,3] U (4,0) = A. ] 


(ii) A subset A C X is said to be dense in X if A= X.AC X 


is said to be nowhere dense if Int(A) = ©. 


[Example D1.8(ii). Consider Example D1.8(i). For the set A = 
( — 2,0), from (1.8), we have 


(A)* = {1} U (3, 4], 


and Jnt(A) = @ (because, as mentioned above, R is the only neigh- 
borhood for all of its points), i.e., A is nowhere dense. The set 


C = {-—1}U {1} U (3, 4] is dense in X.] 


(iit) A point x € X is called an accumulation point (or cluster 
point) of a set A if every neighborhood of x contains at least one 
point of A other than x. The set of all accumulation points is called 
the derived set and is denoted by A’. Directly from the definitions of 
closure and accumulation points it follows that A’ C A. 


[Example D1.8(iii). In Example D1.8(i), A’ = A.] 


(iv) A point x € X is called a boundary point of a set A if every 
neighborhood of x contains at least one point of A and at least one 
point of A°. Analogous to metric spaces, the set of all boundary 
points of A is 


OA= ANAS 
and is called the boundary of A. 


[Example D1.8(iv). In Example D1.8(i), A= @ and 0A = Al] 


(The closure of A is evidently the smallest closed set containing 


A; and A is closed if and only if A = A. See Problem 1.6.) 
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(v) A topological space (X,7) is called separable if there 
exists at most a countable dense subset of X. O 


1.9 Proposition. The closure of a set A is the smallest closed set 
containing A. Furthermore, A is closed if and only if A = A. 


(See Problem 1.6.) 


Convergence of sequences introduced below generalizes that of 
Section 3, Chapter 2, for metric spaces, and it is preparatory for the 
more general type of convergence of nets and filters treated in 
[Dsh2]. 


1.10 Definition. Let {x,: n = 1,2,...} C (X,7) be a sequence 
and let A be a set. A subsequence Qn = {a,: n= N,N +1,...} 
is called an N(A)-tail of {x,,}, for some N > 1, if Qy C A. A se- 
quence {z,,:n=1,2,...} C X is said to converge to a point x € 
X if for every neighborhood U, of x, there is an N(U,,.)-tail of {2 }. 
The point « is said to be a limit point of the sequence. 


Notice that in the literature on topology, the notions of limit and 
accumulation points of a set are synonymous. We do not use both of 
them in the same context and invent a new term for the benefits of 
forthcoming constructions. A point x is said to be a sequential limit 
point of a set A if x is a limit point of some sequence {z,} C A. O 


Unlike metric spaces, a sequence in a topological space can have 
more than one limit as we learn from the following example. 


1.11 Examples. 


(4) Let X=R, let r= {B,O,( — 2,3], | —1,2]}, and let =, 
=+, n=1,2,... . Then, {4} converges to all points of the set 
[— 1,2], because for each point x € [— 1,2], its open neighbor- 
hoods are R, (— 2,3], and [ — 1, 2], each one of which contains the 
whole sequence. Furthermore, for any point x € ( — 2, 3]°, R is the 
only neighborhood of x and R contains the entire sequence. Thus, 


{+} converges to all points in R. oO 


(ii) Let (X,7) be a so-called cocountable (or countable comple- 
ment) topology defined as 7: = {X,@,C*: C is at most count- 
able} introduced in Problem 1.9. We assume that X = R. Consider 
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the sequence ae 0, 5 0, i Oi des , which converges to O in the Eu- 
clidean metric or usual topology. Not so in the cocountable topology. 
The complement of the subsequence {+} is obviously an open 
neighborhood of 0 that contains zero and does not contain a single 
element of {+}. Thus, O, = {+}° does not contain any tail of the 


sequence. 


Using the same argument we can show that the sequence does 
not converge at all. Indeed, if x were a limit of the sequence other 
than O or any other term of the sequence, then {1, 0, $5 0, 7 OD de \ 
would be an open neighborhood of x that does not include a single 
term of the sequence and thus cannot be its limit. O 


Very important classes of topological spaces we make use of are 
introduced in the following. 


1.12 Definitions. 


(i) A topological space (X,7) is said to satisfy the first axiom 
of countability (or to be first countable), if each point « € X has an 
at most countable neighborhood base. 


(ii) A topological space (X,7) is said to satisfy the second 
axiom of countability (or to be second countable), if (X,7) has an at 
most countable base. 


1.13 Proposition. Let (X,7) be a first countable topological 
space and x € X. Then, there is a sequence {x,} C X convergent 
to x. 

Proof. If U/,, = {U;, U2,... } is an at most countable neighbor- 
hood base at x, we can form B, = Uj and B, = () Ui,n = 1,2,.... 

i=1 
Then, 6, = {B,, Bo,...} is clearly a monotone nonincreasing 
neighborhood base at x. Each B, is nonempty and it contains x. 
Using the axiom of choice, there is a sequence {z,,} such that 
XL» € B,. Now, this sequence obviously converges to x by Problem 
1.23. This convergence is with respect to B,, but it clearly is valid 
with respect to any other neighborhood system. O 
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1.14 Proposition. Suppose (X,T) is a first countable topological 
space and A C X. Then, x € A, if and only if there is a sequence 
{t,}C A such that x, — «x. In other words, the closure of a set A 
contains all of its sequential limit points or equivalently, every 
closure point of A is a sequential limit point of A. 


Proof. 


(i) Let x € A and let B, be a monotone nonincreasing countable 
neighborhood base (as per the construction in Proposition 1.13). By 
the definition of a closure point, each B,, € B, meets A. Thus, from 
AN B,, we can choose an x,, thereby forming a sequence {x,,} C A. 
As in Proposition 1.13, x, — 2. 


(it) Conversely, if there is a sequence {x,} CA such that 
XL» — x, for each neighborhood U,, there is a tail of the sequence in 
U, and therefore U,, meets A (through this tail). Consequently, x is a 
closure point. ] 


1.15 Remarks. 


(i) Notice that in part (27) of Proposition 1.14 we did not utilize 
the first countability of the space and yet were able to show that each 
limit point is a closure point. The converse of this (meaning that each 
closure point is a limit point) does require the first countability as we 
learn it from Example 1.16 below. 


(it) In metric spaces (see Corollary 3.4, Chapter 2), a point is a 
closure point of a set A if and only if it is a sequential limit point of 
A. This does not apply to topological spaces. More specifically, a se- 
quential limit point is always a closure point as in Proposition 1.4 
(iz), but the converse is not true. Let x be a closure point of A. If x 

€ A, then setting x, = x, we have a sequence convergent to «x. If x 
¢ A then for each neighborhood U, of x, as a closure point, U;, 9 A 
# @. In this case, it is not clear how to form a sequence convergent 
to x, that is, how to ensure that for each U,,, there is an N(U,,)-tail, 
for we do not have the flexibility of metric spaces with balls like 
Ba, +) of Theorem 3.3, Chapter 2. In Example 1.16 we demonstrate 
a case of a topology where a set A contains all of its sequential limit 
points and yet is not closed, or, in other words, some closure points 
of A are not its sequential limit points. ial 


1. Topological Spaces 181 


1.16 Examples. 


(i) Let (X,7) be a cocountable topology introduced earlier in 
Example 1.11(2i) ast: = {X,@,C*° : C is at most countable} on an 
infinite set X. We assume that X is uncountable. Let {zx,,} be a 
sequence with infinitely many distinct terms, that is, as a set 
{x,,} *N. Thus, there exists a subsequence {z,,} with all distinct 
terms. Suppose there is a limit x of {x,,}. Take O, = {xn,}° U {a}. 
Then, O,, is an open neighborhood of x that does not contain any tail 
of {x,,}. Therefore, any such sequence cannot have a limit. 


(it) Now, under the condition of (7), let {x,} be a sequence 
with all but finitely many terms identical. Thus, for some N, xy = 
LNi, =... = :a. Consequently, any open neighborhood O, 
contains the N-tail of {x,,} and thus x, — x. 


It is easy to verify that any sequence in the cocountable topology 
can have at most one limit. 


(iit) Consider the cocountable topology 7 on R introduced in 
Example 1.11 (i). Take A = (a,b) where a < b. Let {x;} C A be 
any sequence. Then, by the definition of 7, the complement of {2; } 
is open (and disjoint from {x;}). If this sequence has a limit x € A‘, 
then this limit should belong to the open set O, = {x;}° (because 
{an} CA = A® C {a,}°), so that O, can serve as an open neigh- 
borhood of z. 


However, this neighborhood does not have a single element of 
the sequence and, therefore, x cannot be its limit; or equivalently, 
this sequence cannot converge to any point of A°. Therefore, if x is a 
limit of {x,,}, then 7 € A, and by (77), the sequence must have all 
but finitely many identical terms. Consequently, for each x € A, 
there is a sequence in A convergent to x. So, A = (a, b) contains all 
of its sequential limit points. 


However, A is not closed. To see this, take a in association with 
set A = (a,b). Let O, be any open set of the form R\ {any sequence 
not containing a}. Then O, is an open neighborhood of a. Any such 
neighborhood O, meets A, and thus a is an accumulation point of A 
(as any neighborhood of a contains at least one point of A other 
than a). Thus, A is not closed, for otherwise, by Problem 1.11, it 
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would contain all of its accumulation points. Now, a is thus a closure 
point, but not a sequential limit point of A apparently, because X 
fails to be first countable, according to Proposition 1.14. 0 


1.17 Remark. In conclusion, topological spaces fail to have the 
property the metric spaces do, namely, 


a € A < Jasequence in A whose limit is x. 


With no additional condition, we learned that this result was valid 
for metric spaces (see Theorem 3.3, Chapter 2). In other words, a set 
was closed if and only if it contained all of its limit points. O 


PROBLEMS 


1.1 Show that the collection 7 of sets introduced in Example 1.2 
(zv) is a topology in R. 


1.2 Let X be a nonempty set and 7 = {X,0,C° :C C X and C 
is finite}. Show that 7 is a topology on X. 7 is called the 
cofinite (or finite complement) topology on X. 


13 Let X = Rand let 7 = {X, @, (—c, 1], [1, 00), (3, 10]}. Isr a 
topology on R? If not, complement 7 by some subsets to a 
topology (and be reasonable). 


1.4 Prove Proposition 1.6. A C X is a neighborhood for all of its 
points if and only if A is open. 


1.5. Prove Proposition 1.7. (Hint: Show that A‘ contains all open 
sets that are contained in A and use Proposition 1.6.) 


1.6 Prove Proposition 1.9. The closure of a set A is the smallest 
closed set containing A. Furthermore, A is closed if and only 


ifA=A. 
1.7. Show that: 
(a) ACBS ACB. 
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1.8 
1.9 


1.10 


1.12 
1.13 


1.14 
1.15 


(b) AUB=AUB. 


(c) AMBC ANB; show that the inverse inclusion need 
not hold. 


(dd) ACBSACB. 


(e) nt(AN B)= ANB. 


(f) Is it true that Int(A) = A? 


Show that A = AUOA. 


Let X be an infinite set and let 7 be a cocountable topology 
introduced in Example 1.11 (i7) as 7: = {X,@,C°: C is at 
most countable}. Show that 7 is a topology on X. 


Show that A= A+0A (Hint: Proceed in the same way as in 
Problem 2.11, Chapter 2, and work with a neighborhood 
instead of a ball.) 


Prove that a subset of a topological space is closed if and only 
if it contains all of its accumulation points. 


Show that A= AU A’. 

Let 7 = { R, ( ~ L, i, [0, 3); {0}; {10} }. 

a) Extend 7 to the smallest topology 7 in R generated by 7. 

b) Let A= (-—7, —5], B=(0,7], and C =[- 4, 20). 
Find the sets 


A,B,C,4,B,é A’, B’,C',0A,0B,0C 


and determine whether A, B, and C are dense in R. 


Show that 0A = @ if and only if A is open and closed. 
Show that (A)° C A®. 
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1.16 


1.17 


1.18 


1.19 
1.20 


1.21 


1.22 


1.23 


1.24 


1.25 


1.26 
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Show that the inverse inclusion in the previous problem holds 
if and only if A is closed and open. 


This provides an equivalent definition of a closure point. Show 
that x € Aifand only ifVU, €U,, U,.N AFG. 


A subset of a topological space (X,7T) is a G5 set if it is a 
countable intersection of open sets. A subset F,, is defined as a 
countable union of closed sets. Show that G's can be represent- 
ed as a countable intersection of a monotone nonincreasing se- 
quence of open sets. Likewise, show that F', can be represented 
as a countable union of a monotone nondecreasing sequence of 
closed sets. 


Show that any neighborhood system at a point is / -stable. 


Let U/, be a neighborhood system at x. Show that for each 
U €U,, there isa V € U, such that U € U, forally € V. 


Let B, be a neighborhood base at x. Show that if A, B € B,, 
then there is a base neighborhood C' € 6, such _ that 
CCANB. 


A set O is open in (X,7) if and only if O contains base 
neighborhoods for all of its points. 


Show the validity of the following statement. A sequence {xp: 
n=1,2,...} C X converges to a point x € X if and only if 
for every base neighborhood B, € B, of x (where B, is an 
arbitrary neighborhood base), there is an N(U,)-tail of {ay}. 


Let {7; : 7 € I} be an indexed family of topologies on a non- 
empty set X. Show that (] 7; is a topology on X. 


iel 


Let S C P(X) be a collection of subsets of a nonempty set X. 
Show that there is a topology containing S. Also show that 
there is the weakest topology containing S, in notation 7(S). 


In the context of Problem 1.18 about Gs and F, sets, show that 
in (R”,7.), each Gs € 7S and each F, € T°. 
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2. BASES AND SUBBASES FOR TOPOLOGICAL 
SPACES 


In the previous section, we introduced the notion of a collection of 
open sets, called a topology. In many applications, describing an en- 
tire topology on a carrier is difficult and sometimes even impossible. 
This predicament is manageable if one deals instead with a smaller 
collection of sets, which is not a topology, but which generates a 
topology and thereby can be readily extended to a topology. With a 
similar idea, we have come to introduce neighborhood bases in place 
of more populous neighborhood systems for points. 


Take, for example, a metric space. Although the family of all 
open balls does not yield a topology, every open set, as we know, can 
be made of the union of some collection of open balls, and conse- 
quently, it leads to a topology and gives rise to the notion of a base 
for a topology. 


When specifying a base for a topology as its fingerprint, what is 
the connection between a base for the topology and neighborhood 
bases collected over all points? This question will also be answered 
in this section. 


2.1 Definition. Let (X,7) be a topological space. A sub- 
collection B of open sets is called a base for 7 if every open set is a 
union of some elements of B. (Specifically, it follows that @ must be 
an element of 6.) The elements of B are called base sets. O 


With no major difficulty (and with hints provided), the reader 
will be able to establish a very useful criterion of a base for 7, subject 
to Problem 2.2. An important relation between bases and neighbor- 
hood bases (see Definition 1.5(ii)) is given in the following theo- 
rem. 


2.2 Theorem. BG is a base for tT if and only if, @ € B and, for 
every point x € X, there is a neighborhood base B, consisting of 
open sets such that B, © B. 


Proof. We have to show that B is a base for 7 if and only if, for 
every x € X and each neighborhood U, of «x, there is a base 
neighborhood B,. € B such that B, C U,. 
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(2) Let B be a base for 7 and let U, be a neighborhood of a 
point z € X. Without loss of generality we assume that U,, is open. 
(Otherwise, take any open neighborhood O, C U, of x and work 
with O, instead.) If U, is open, there exists a subcollection of B 
whose union equals U,,. Thus, at least one set of this subcollection, 
say B, (€ B), must contain x, and B, C U,. Observe that by Defini- 
tion 1.5(ii), B, is then an element of a neighborhood base and 
B., = {B,} forms a neighborhood base of x. Therefore, each neigh- 
borhood base U/, of x has at least one neighborhood base B, of x 
such that 6, C B and each U, € U, is a superset of at least one 
B, € By. 


(22) Let B C7 and assume that for every x € X, there is a 
neighborhood base B, C B. Let O be an arbitrary open set. Then, by 
our assumption and by the definition of a neighborhood base, for any 
point x € O (because O is a neighborhood of x), there is a base 


neighborhood B, € B, such that B, C O. Thus O =|) B, (union 
xrEO 

of all such B, € B). Hence, every open set O € 7 can be formed of a 

union of some elements of 6 or, equivalently, Bisabase fort. O 


2.3 Examples. 


(i) Let {B,:z € X} be an arbitrary collection of open neigh- 

borhood bases at all points. Then, () B, can be regarded as an ex- 
zEx 

ample of a base for 7. Indeed, as in Theorem 2.2, take a point x from 
any open set O. Then, O is a neighborhood of x and thus it belongs 
to the neighborhood system at x. If B, € {B,:z € X} is a neigh- 
borhood base, then there is a base neighborhood B,, € B, included in 
O. Collecting all such neighborhoods of all points of O, we can re- 


present O as the union |) B,. Hence, {B.:z € X} is a base for the 
xEO 
topology T. 


(ii) As mentioned at the beginning of this section, in any met- 
ric space (X,d), the collection of all open balls is a trivial example 
of a base for the corresponding metrizable topological space. Indeed, 
by Definition 2.3, Chapter 2, for each open neighborhood O, of 
x € X, there exists an open ball B(x,¢) C O,. Earlier [in Example 
D1.5(227)], we showed that B(x, €) is a base neighborhood at x. Thus 
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by Theorem 2.2, the system {B(x,¢): « € X, € € (0, 00)} isa base 
for T(d). As in Example D1.5(#iz), a neighborhood base at x can be 
reduced to the system B, = {B(x,q): EQ, q> 0} of all balls 
with rational radii. Consequently, by Theorem 2.2, the collection of 
all open balls with rational radii is a base for T(d). (Note that these 
balls are centered at all x € X, so consequently, this base need not 
be countable.) 


(iii) We introduce an open rectangle in (R",72"). More formal- 
ism on this related to the product topology is follows in Section 5. A 
set 


OW x. x OM 


is called an open rectangle (or parallelepiped ) in R” if each O") is 
an open set in R. [See also Definition 5.4(vz), Chapter 1.] An open 
rectangle is referred to as simple or base if each O™ is an open 
bounded interval. Let R be the system of all base rectangles in 
(R", 72") along with the empty set @ and let B(x,r) be an open ball 
centered at some point x. Then, for each r, there is a hypercube C’, 
centered at x (which, in particular, is a base rectangle) that can be 
inscribed in B(x,r). Thus, the system C,, of all hypercubes at x can 


serve as a neighborhood base at x in 7’ and by (7), C=U C, isa 
xER” 
base for 7;’. 


(iv) The collection of all singletons {x} € P(X), along with Q, 
is a base for the discrete topology on X. O 


2.4 Theorem. Let 6, and By be bases for topologies 7, and T2 on 
X. Then the following statements hold true. 


(i) If By C 72, then % C 7». 
(it) If B, = Bo, then T1 = T9. 
(iit) If, CT and Bo C 1%, then T = To. 
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Proof. 


(i) (Observe that 6; need not be a base for 72.) Indeed, by the 
definition of a base, each O! € 7 can be represented as O! = U B;. 
7 


However, because B; € 72, it follows that UB; = O! € 7. 
1 


(iz) By part (2), 7, C 7) and 7) C 7, and thus 7, = 7). In other 
words, a base uniquely defines a topology. (Note that although one 
topology may have different bases, a base cannot share different to- 
pologies.) 


(iiz) First note that it does not follow that By is a base for 7). In 
fact, By need not even be a subcollection of 7,. However, if in 
addition, By C 7, then by part (2), T2 C 7, and therefore, 7, = 72. 
Indeed, Bz C 7, € 72 implies that 7, = 7». O 


When constructing a topology on a carrier, it is often beneficial 
to start with a collection, yet smaller and even more rudimentary than 
a base. With such a reduction the formation of product topologies 
and quick and tame continuity criteria of functions becomes even 
more rewarding. Recall that a function f, corresponding between 
two metric spaces X and Y, is continuous if and only if inverse 
images under f of open sets in Y are open in X. Remarkably, conti- 
nuity of f can be verified frequently on the basis of much less po- 
pulous families of so-called “subbase” sets in Y. This is established 
and elaborated in Section 4. We begin with the following. 


2.5 Definition. Let S C P(X) such that Ll) A= X. If there 
AcS 

exists the weakest topology on X containing S, then it is called the 

topology generated by S, and the collection S is called a subbase on 

X. (Note that S can directly restore only X, whereas 6 restores all 

open sets, including @. Clearly, a base 6 for a topology 7, besides 7 

itself, offers a trivial example of a subbase on X.) O 


We justify Definition 2.5 with the following. 


2.6 Proposition. The weakest topology on X generated by a 
subbase exists. 


(See Problem 1.25.) 
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2.7 Definition. As per Proposition 2.6, if S is a subbase on X, 
then the weakest topology generated by S is denoted by 7(S) and we 
call S the subbase for T(S). O 


If we have an arbitrary collection S of subsets of X, it seems 
plausible to expand S by supplementing it with X, @, as well as all 
unions and finite intersections of elements of S. Theorem 2.8 below 
refines this idea by showing that if S is a subbase, on the way to the 
construction of 7(S) we need to extend S with all finite intersections 
of its elements to arrive at a base for T(S). 


2.8 Theorem. Let S be an arbitrary subcollection of P(X) such 


that |) A= X,and let B be an extension of S that contains © and 
AcS 


all finite intersections of elements of S. Then B is a base for T(S). 


Proof. Let 7’ = { l\) BeBe B} wher B is defined in the 
BeB 
condition of this statement. We prove that 7’ is a topology on X. It is 
sufficient to show that r’ contains all finite intersections; the other 
properties of r’ as a topology are obvious. Also, for brevity in 
notation, we show this for the case of the intersection of just two 
“open” sets. Let U and V be two elements of 7’. By the definition of 
B, 
U=UU; and V = U V;,, (with U;, V; € B), 


iel jet 


where in turn 


U; = () Si and V; =) S!, (Si, 82 €S). 
— s=1 


k=1 s 


Then 
rr dy 


UnV=(U NSPA(U NS)= U Nst er. 


i€l k=1 jet s=1 relIUJ p=1 


Now, because by the construction of 7’, B is a base for 7’, and 
BCr(S) C7’, by Theorem 2.4(iii), identifying 7(S) as 7,7’ as 
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T2, and B as By, we have 7(S) = 7’. In particular, we see that B is a 
base for 7(S). O 


2.9 Remark. From Theorem 2.8, it may appear that a base for a 
topology must be 1-stable. Indeed, by the construction of the 
weakest topology r(S), we moved from S to B(S) including all 
finite intersections of subbase sets, which turned to a base of 7(S). 
Moreover, 6(S) was extended to 7(S) by the mere inclusion in 7(S) 
of all unions of base elements, which looks like the fundamental con- 
struction of a topology beginning with an almost arbitrary family of 
subsets. 


However, the truth is that, in general, an arbitrary base for a 
topology need not be ™-stable at all. Firstly, notice that the family 
of all open balls in a metrizable topology is a base, but it is not M - 
stable. In Example 2.9(7i7), the collection of all hypercubes in R" is 
a base for the usual topology 7’, but it obviously is not ™ -stable 
either. Secondly, from Problem 2.2, if 6 is a base for some 7, the 
intersection of two base elements if not empty must allow to hold as 
a subset, another base element, but it does not require that the inter- 
section of these two base sets be a base set. 


Now, the system C of all hypercubes can be extended to an O - 
stable family which can be easily guessed, brings us to the set R of 
all simple rectangles, which obviously becomes a larger base for 7,’ 
compared to C. Thus, 7’: = 7(C) = T(R) C 7, simply because 7’ 
is the weakest topology containing C and R, and CC RC 7’. On 
the other hand, because R is a base for 72’ and r(R), by Theorem 
2.4 (it) the topologies r(C) = T(R) and the usual topology 7!’ must 
coincide. Recall that metrics d. and the supremum metric are equiv- 
alent on IR”. No wonder that 7!’ and 7(C) coincide. 


In a nutshell, an idea that every base for a topology is / -stable is 
taken out of context from Theorem 2.8. O 


2.10 Examples. 


(i) In Example 2.3(iii) and Remark 2.9, we argued that the 
system C of all hypercubes and the system 7 of all simple rectangles 
are bases for (IR”, r:’), of which the second is M -stable. We can take 
for a base for 7(S) in R” the system of all simple rectangles with ra- 
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tional coordinates, which is certainly closed relative to all finite 
intersections and is also a base for 72’. Because this base is obviously 
countable, it thus follows that (IR", 72”) is second countable. 


(it) In another scenario of (R”,7'), the collection of open 
cylinders of types 


m*((a;, b;)) = R x «+» x R x (a,b) X Rx ... XR, 


where (a;, b;)'s are open bounded intervals in R, 7 = 1,...,, forms 
a subbase for 7;’. Now, the intersection of two cylinders is a cylinder 
or the empty set only within the cylinders {7*((a;, b;))} for the same 
fixed 7. Thus, the entire family of cylinders is not M -stable, but it 
can be extended to an ™ -stable base B by including in B the empty 
set © and all finite intersections of the subbase rectangles. Base 6 
evidently contains FR and is also a base for 7(R) and 7”. ial 


PROBLEMS 


2.1 Let (X,7) be a topological space and let B C 7. Show that B is 
a base for 7 if and only if for every open set O € 7 and each 
point x € O, there is a subset B of O such that x € Be B. 


2.2 Show that BC P(X) is a base for a topology on X if and 
only if two conditions are met: 


(i) each x € X belongs to at least one set B € B (or equiva- 
lently, X= |) B). 
BeB 
(iz) VB,,By€B and Va € Bi N By, ABEB such that 
tEBCB, NB». 


[Hint: Use the following steps. 
1) If B is a base, then apply Theorem 2.2(iz). 
Distre* L) Beye es By. Show that 7 is a topology 


BeB' 
on X and that B is a base for 7.] 
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2.3 


2.4 


2.5 


2.6 


2.7 


2.8 


Let B be a base for a topology 7 on X. In as much as B, in 
particular, is a subbase on X, it also generates the weakest 
topology 7(B) and hence 7(B) C tr. Is 7(B) = 7? 


Let B, and By be bases for the topologies (X,7,) and (X,72), 
respectively. Prove the equivalence of the following statments. 


(i) For each x € X and each base set A, € B, that contains 


zx, there is a base set B, € By that contains x and 
By © Ag, 


(it) 72 is finer than 7}. 


Let 7; denote the topology on the real line generated by all half- 
open intervals of type [a,b) where a,b € R. This topology is 
called the ower limit topology. Show that {[a, b):a,b € R} is 
a base for 7 and that 7, is strictly finer than 7,, the usual 
topology on the real line. 


Show that the collection of all sets on the real line of types 
(a, oo) and (—o«, b) is a subbase for the usual topology (R, 7.). 


Show that any base and subbase rectangles in Example 2.3(777) 
and Example 2.10(iz), respectively, are open sets. 


Show that the usual topology 7. on R” and the product 


topology Tp generated by x 7.” coincide. 
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3. HAUSDORFF TOPOLOGICAL SPACES 


We start this section with the construction of a rudimentary pro- 
duct topology that will be revisited in Section 5 and which 
presently plays a provisional role in connection with the unique- 
ness of limits in some topological spaces. The entire section is 
dedicated to the convergence of sequences, first introduced in 
Definition 1.10. We also acquaint the reader with one of the 
most common classes of topological spaces, the Hausdorff 
space, in which all convergent sequences have unique limits, a 
very useful feature we come to appreciate in metric spaces. 


3.1 Definition. Let (X),7;) and (X2,72) be two topological 
spaces. Define 


1 X%]={AxB:AEN, BET}. 


Then, obviously, (J U = Xj x X2 and, in addition, by Problem 
Uer XT2 

5.6(a) of Chapter 1, 7, x 72 is M-stable. Therefore, by Theorem 2.8, 

we can form a topology 7(7, Xx 72) generated by 7 x 72 (as a base), 

denote it by Tp, and call it the product topology. (We discuss more 

general variants of Tp in Section 5.) O 


We continue with a useful lemma needed in the sequel. 


3.2 Lemma. 4 sequence {&, = (Ln, Yn)} GC X1 X Xo is conver- 
gent to a point € = (x,y) if and only if (marginal sequences (| {x} 
and {y,,} converge to x and y, respectively. 


Proof. (i) Let €,, — € in rp. Then the marginal sequences {z,,} 
and {y,,} converge to x and y, respectively. Indeed, let O, and O, be 
open neighborhoods of « and y. Then, O,, x O, is a base set in Tp 
and an open neighborhood of €. Because {€,,} is convergent, there is 
an N(O, x O,)-tail of {€,}. Hence, marginal tails {ay,vy4i,...} 
and {yn,yw+i,--.} are subsets of O, and O,, respectively. 
Therefore, x, — x and y, — y. 
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(it) Conversely, let {x,,} and {y,} be two sequences from X1 
and X» that converge to x and y, respectively, and let We be an open 
neighborhood of € = (x, y). Then, by the same arguments as in part 
(7), there is an open subset (as a base set) O; x Oy C We such that 
€ € Oj X Oo; that is, O; and O2 are open neighborhoods of x and y. 
Thus, there is an M(O})-tail of {x,} and N(O2)-tail of {y,,} imply- 
ing that the R(O, x O2)-tail (where R = max{M, N}) is a subset 
of O7; x Oo and thus of We. O 


Now we prove a simple but important statement that if 7, and 7) 
are first countable, then so is Tp. 


3.3 Proposition. Let (X,,7)) and (X»2,7 2) be first countable 
topological spaces. Then (X, X X2,Tp) is also a first countable 
topological space. 


Proof. Let € = (x,y) € X; x X». Because X, and Xp are first 
countable, there are countable neighborhood bases B, = {Uj : j = 
1,2,...} and By = {V*:k=1,2,...} of x and y, respectively. 
Define B; = {Uj x ay : j,k =1,2,...} and show that it is a 
neighborhood base for €. Without loss of generality and to simplify 
the proof we assume that all neighborhoods involved are open. 


Now, let We be an open neighborhood of €. Then, since 7; X 72 
is a base for Tp, We can be representedasWe= U (A x B). 


some AET,BET2 
Hence, there is an A x B C W¢ such that € € A x Bora € A and 
y € B. Because B, and 6, are neighborhood bases for x and y and 
A and B are open neighborhoods of x and y, there are U/ and Vv; 
that are subsets of A and B. Consequently, Uj x V;'C Ax B 
C We. Thus, Be is indeed a neighborhood base for € and it is 
countable. O 


In various applications encountered in topological spaces we deal 
with (including those in forthcoming Chapters 5 and 6), every con- 
vergent sequence has exactly one limit. An important representative 
of this class is introduced in the definition below. 
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3.4 Definition. A topological space (X,7T) is said to be 
Hausdorff (or separated or T>) if every two distinct points, 
x,y € X, possess disjoint neighborhoods. O 


T> is a designation of the second separation axiom. (The charac- 
ter 7’ stands for the German word Trennungsaxiom.) Other separa- 
tion axioms are introduced and discussed in [Dsh2]. As mentioned, 
the next proposition (which is hardly a challenge for the reader) is a 
consequence of Hausdorff spaces. 


In Example 1.11(2) we saw that a sequence in general can have 
more than one limit (speaking conservatively). The “Hausdorfness” 
of a topological space is a noteworthy and often used restriction 
emulating metric spaces. 


3.5 Proposition. Let (X,7) be a Hausdorff topological space, 
and let itp on. = &, aid lity se tn = y. Then c= y. O 


(See Problem 3.1.) 


3.6 Example. Let (X,d) be a metric space and let (X,7(d)) be 
the corresponding metrizable topological space. With x; and x2 
being distinct points of X, we pick two open balls, B(a1,7r) and 
B(x2,7r), with r= za(x1, x2). It follows that the balls are neighbor- 
hoods of x; and x2, respectively, and that B(x,,r) N B(a2,r) = @. 
This immediately implies that any metrizable topological space is 
Hausdorff. O 


3.7 Remark. We want to raise an important question: when can 
Proposition 3.5 be reversed, that is, when does the uniqueness of 
limits imply that the space (X,7) is Hausdorff? In Example 1.16(77) 
we argued that in a cocountable topological space, the only se- 
quences with at most finitely many distinct terms can have limits and 
they are unique, yet this space is not Hausdorff. Indeed, any two open 
neighborhoods O,, and O, of any two points x and y are not disjoint, 
because they differ only upon at most countable sets. So, what will 
then a reasonable sufficient condition be that can make a topological 
space with the “uniqueness of limits property” Hausdorff? O 


We begin with the lemma below that builds promised bridges 
between uniqueness of limits of sequences and Hausdorffness of 
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spaces. The same result is generalized and applied to filters and nets 
in [Dsh2]. 


3.8 Lemma. Let (X,7) be a topological space. Then (X,T) is 
Hausdorff if and only if the diagonal D = {(x,x) € X*} is closed in 
the product topology (X°, Tp). 


Proof. 


(i) Let (X,7) be Hausdorff and let € = (x,y) € D°. Thus, 
x #y and due to the “Hausdorfness” of X, x and y have disjoint 
open neighborhoods O, and O,. In as much as they are disjoint, 
obviously O, Oy, is disjoint with D. Furthermore, because 
O, 1 Oy is a base set of Tp, it is open and thus € is an interior point 
of D© = D 1s closed. 


(it) Conversely, if D° is open, it can be represented as a union 

of base open sets, that is, as D> = |) Ax B. Then given a 
some A,Ber 

point (x,y) € D*, at least one of these sets A x B, say O, x Oy C 
D°, must contain (z,y). Thus, 2 € O, and y € O, and, conse- 
quently, O,, and O, are open neighborhoods of x and y, respectively. 
They are disjoint, because O, x O, C D°. Hence, (X,7) is 
Hausdorff. O 


Now we are ready for the main result of this section. 


3.9 Theorem. Let (X,7) be a first countable topological space 
in which limits of all convergent sequences are unique. Then, (X,T) 
is Hausdorff. 


Proof. Let D be the diagonal in the product topology Tp and let 
€ € D. By Proposition 3.3, tp is first countable. Therefore, by 
Proposition 1.14, there must be a sequence {(%n, 2) } C D such that 
(2n, Ln) > € = (x,y). By Lemma 3.2, the marginal sequence {z,,} 
must converge to x and y. However, since by our assumption, all 
convergent sequences in X must have unique limits, x = y and 
hence € € D implying that D C D and thus D is closed. By Lemma 
3.8, (X, 7) is Hausdorff. O 
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PROBLEMS 


3.1 


3.2 


3.3 


3.4 


3.5 


3.6 


ae) 


3.8 


Prove Proposition 3.5. Let (X,7) be a Hausdorff topological 
Space. Nitty se ty =o. and lee Nitin sg o, = 9: Then = 9; 


Show that any one-point set in a Hausdorff space is closed. 
Show that any metric space is first countable. 
Prove that any separable metric space is second countable. 


Is it true that any first countable topological space is also 
second countable? 


Prove that if a topological space is second countable, then it is 
separable and first countable. 


Show that subspaces of second countable spaces are second 
countable. 


Prove that the usual topology 7’ on R” and the product 


topology Tp on R” generated by x 7h) coincide, where 760 = 


Te 1s the usual topology on R. 
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4. CONTINUITY IN TOPOLOGICAL SPACES 


Except for a brief introduction of sequences (being restricted ver- 
sions of functions) in Section 3, in the present section functions 
appear for the first time in conjunction with topologies. Naturally, 
their most significant quality that we look into is continuity. 


After a first acquaintance with continuity in metric spaces 
(Section 4, Chapter 2), the reader is now well prepared to its sur- 
prising generalization for topological spaces and a striking similarity 
between Theorem 4.2 below and Theorem 4.3, Chapter 2, with 
respect to a key continuity criterion. Again, we observe some other 
continuity properties, typical for metric spaces and holding for 
special topological spaces, yet more general than those in metric 
spaces. One of them deals with an important relationship between 
convergence of sequences and continuity of functions initiated in 
Chapter 2 (earlier formulated as Theorem 4.4 and pledged to be 
proved in this section). 


4.1 Definitions. 


(i) A function f:(X,7) — (Y,71) is said to be continuous at a 
point a € X if, for every neighborhood Ws ,,), there is a neighbor- 
hood U;, such that f.(Ua) © Wea). 


This is equivalent to the following definition. f is continuous at 
a, if for every neighborhood Wy q) of f(a), f*(Wya)) is a neigh- 
borhood of a (see Problem 4.1). 


(it) The function f is said to be continuous on X (or simply 
continuous) if it is continuous at each point a € X. O 


4.2 Theorem. Let f: (X,7) — (Y,7™) be a function. Then the 
following are equivalent. 


(i) f is continuous. 


(ii) The inverse image under f of any open set H © 7, is open, 
that is, is an element of T. 
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Proof. 

(i) => (i). Let H € 7. For each point a € f*(A), f(a) € H 
and therefore f(a) is an interior point of H. Specifically, H is a 
neighborhood of f(a). Because f is continuous at a, there is a 
neighborhood U, such that f(U,) C H. Because the inclusion is 
preserved under the inverse, we have 


Ua S f*(f(Ua)) S f*(A); 


which implies that f *() contains a neighborhood for each of its 
points. Hence, f*(H) is itself a neighborhood for all of its points. 
Therefore, by Proposition 1.6, f*(#) is open, i.e. is an element of T. 


(ii) = (2). Leta € X and let W5(,) be a neighborhood of f(a). 


Then, there exists an open set H € 7, such that f(a) © H C Wyiq). 


By assumption (ii), f*(H) an element of 7. Because obviously a € 
f*(A), f*(4) is a neighborhood of a and thus f*(Wy(q)) is also a 
neighborhood of a. Consequently, we have continuity of f at a. O 


Let (X,7) be a topological space. Recall that 7° denotes the 
collection of all closed sets. 


4.3 Proposition. A function f : (X,T) — (Y,7) is continuous 
on X if and only if the inverse image under f of any closed set 
O° € 7{ is closed in (X,7). O 

(See Problem 4.2.) 

4.4 Proposition. Let (X,7), (Y,7), and (Z, 72) be topological 
spaces and let f: X — Y andg:Y — Z be continuous functions. 
Then the function go f :X — Z is continuous. 0 


(See Problem 4.3.) 
4.5 Definition. Let (X,7) be a topological space and let 
[X,Y , f] be a function. Define 


To={B CY: f*(B) € 7c}, 
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ie., f**(Tg) CT. By the following arguments (Remarks 4.6), Tg is a 
topology and it contains any topology relative to which f is conti- 
nuous. Tg is called the quotient topology induced on Y by f. O 


[Recall that f* is defined on P(X); consequently, f** is a 
function acting on P(P(X)). See Definition 2.1(vii) and (vii), 
Chapter 1.] 


4.6 Remarks. 
(i) Tg is indeed a topology: 


1) O,Ye TQ. 
2) Bip vinyDe eto = 7 (Be) =f F Be) ee Gs 
k=1 hol 


n 
the intersection of open sets) > {) By € TQ. 
k=1 
3) A similar consideration can be used to show that Tg 
contains all unions. 


(ti) Tg is the largest topology on Y relative to which f is 
continuous. This follows directly from Definition 4.5. O 


4.7 Example. Let X = R, + = {R, @, (— 1,2], [0,3), [0, 2], 
(— 1,3), (—1,1)} and let f(z) =? defined as f:R — R=Y. 
It is clear that R, @ and [0,1) are the only subsets of Y whose 
inverse images are in T. Therefore, {R,@, [0,1)} is the quotient 
topology on Y. 


By Theorem 4.2, f: (X,7) — (Y,7’) is continuous if and only 
if f**(7') Cr. However, if we know a generator S of 7’, then 
condition (zi) of Theorem 4.2 can be relaxed as per the following 
theorem. 


4.8 Theorem. Let f: (X,7) — (Y,7(S)) [where r(S) is the 
topology generated by a subbase S]. Then f is continuous if and 
oni if { "(S) Cr, 

Proof. If f is continuous, then, in particular, f**(S) Cr. 
Assume that f**(S) C 7 and introduce the quotient topology tg 
induced by f. Thus, S C tg, which implies that 7(S) C 79, for T(S) 


206 CHAPTER 3. ELEMENTS OF POINT-SET TOPOLOGY 


is the smallest topology containing S. Then in as much as f **(7Q) 
C 7, we have 


f*(7(S)) SF (tTQ) ST. O 


4.9 Theorem. Let f: (X,7) — (Y,7’) be a map continuous at 
some point x« © X. If {i+ is a sequence convergent to x, the 
sequence { f(x,)} is convergent to f (x). O 


(See Problem 4.10.) 


4.10 Remark. Let (X),7;) and (X2,72) be two topological 
spaces and Tp be the product topology on X?. Obviously both pro- 
jection maps 7, and 7 are continuous, because 7} (O1) = O; x X2 

€ T, X T2 C Tp. So, m7* (71) C Tp. Thus, 7 is continuous and so is 

mT. It offers an alternative argument to part (7) in Lemma 3.2 that a 
sequence {&, = (2n,Yn)} CX, x Xq is convergent to a point 
€ = (x,y), then marginal sequences {z,,} and {y,} converge to x 
and y, respectively. 


Indeed, by Theorem 4.9, the marginal sequences {71 (&,) = rn} 
and {72(En) = yn} converge. More specifically, if (an, yn) — (2, y), 
then x, — x and y, — y. a 


Theorems 4.8 and 4.9 and the next theorem form an analogue to 
Heine's Theorem 4.4, Chapter 2, which was valid for metric spaces. 
The statement in Theorem 4.9 has no restriction as to the nature of 
topological spaces (X,7) and (Y,7’), although its converse needs to 
be constrained by the condition that (X, 7) is first countable. 


4.11 Theorem. Let f:(X,7) — (Y,7') bea map and let (X,T) 
be first countable. If for any sequence {x;,} convergent to a point 
x € X, the sequence {f(x,,)} converges to f(x), then f is conti- 
nuous at x. 


Proof. To prove this theorem, we assume that f is not con- 
tinuous at x; then select a sequence {2,,} convergent to x such that 
{f(an)} does not converge to f(x). The assumption that (X,7) is 
first countable is essential in the selection of a convergent sequence 
{x,}, which otherwise need not exist. If f is not continuous at x, 
there is a neighborhood W‘;,.. such that f *(W 7) ) is not a neighbor- 
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hood of x, or equivalently, there is no neighborhood U, such that 
f (Ux) © Wyo). (Otherwise, if f(Uz) C Wr), then 


Use Fe) SPW). 


This would contradict our assumption. (See Figure 4.1.)] 


Figure 4.1 


Specifically, it follows that, for each base neighborhood B € B,, 
f.(B) is not a subset of W5(,). Because (X,7) is first countable, 
there is a countable neighborhood base B, = {Bj, Bz,...} which 
can always be assumed to be monotone nonincreasing. Now, each B; 
contains at least one point, say x;, such that f(x;) € We), which 
immediately yields that the sequence {f(z,)} is not in W,.,) and, 
thus, does not converge to f(x). However, by Proposition 1.13, 
In > @. O 


Theorem 4.11 leads to some useful applications. Prior to that we 
need the following. 


4.12 Definition. Let f,g: X — Y be two functions. Define 
f@g:X? >Y’ as (f @g)(x1, £2) = (f (21), g(x2)). Consequent- 
ly, we have [P(X?), P(Y), (f ® g)s] specified as 


(f ® g)«(A) = {(f (#1), 9(@2)) : (1,22) € A}, AC X? 
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and [P(Y*), P(X’), (f @ g)*] as 
(f @ 9)*(B) = {(f*{y}, o*{y2}) : (41,42) € B},B CY’. 


In particular, it is readily seen that 


(f @ g)x(Ai x Ag) = fa(Ai) x gx (Az) (4.12) 

and 
(f @ g)* (By x By) = f* (By) x g‘ (By). (4.12a) 
O 


4.13 Lemma. Let f,g: (X,T) — (Y,7’) be two continuous 
maps. If (Y,7') is Hausdorff, then the set S={wExX: 
f(x) = g(x)} is closed in (X,T). 


Proof. Because f and g are continuous, the map f ®g: X? > 
Y? is continuous relative to the respective product topologies. 
Indeed, by Theorem 4.8, f ®g is continuous if and only if 
(f @g)*(7’ x 7') Crp, where Tp is the product topology on X?. 
Since f*(B,) and g*(B2) are open in X, when B, and By are open 
in Y, it follows from (4.12a) that (f @ g)*(B, x Bo) € 7 x TC Tp. 


In as much as by the assumption (Y, 7’) is Hausdorff, by Lemma 
3.8, the diagonal D in Y x Y is closed. Hence, the set S, as the 
inverse image of the diagonal D under the continuous map f ® g 
must be closed, as per Proposition 4.3. O 


4.14 Proposition. Let f,g: (X,7) — (Y,7’) be two continuous 
maps that coincide on some dense set in X. If (X,7) is first 
countable and if (Y ,7') is Hausdorff, then f = gon X. O 


Thus, it follows that a continuous function is well-defined on a 
dense set. The proof of this proposition is the subject of Problem 
4.11. 


4.15 Example. If f,g: (R",7-) — (R",7-) are continuous 
maps that coincide on the set Q” of all vectors with rational 
coordinates, then f and g are identical on R”. This fact takes into 
account that (IR", 7.) is Hausdorff and first countable. CO 
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4.16 Definition. Let (X,7) and (Y,7') be two topological 
spaces. A bijective map [X,Y, f] is called a homeomorphism if both 
f and f~' are continuous. The topological spaces (X,7) and (Y,7’) 
are then called homeomorphic. We write X ~ Y. If f fails to be sur- 
jective, then f is called an embedding of X into Y. X is also said to 
be embedded in Y by f. Cl 


4.17 Remark. It is not difficult to see that the homeomorphic 
property applied to a family of topological spaces between any fixed 
carriers X and Y defines an equivalence relation (show it; Problem 
4.12). O 


PROBLEMS 


4.1 Show that f is continuous at a point a if and only if for every 
neighborhood Wy, , f *(W4(a)) is a neighborhood of a. 


4.2 Prove Proposition 4.3. A function f:(X,T) — (Y,7) is 
continuous on X if and only if the inverse image under f of 
any closed set O° € Tf is closed in (X,T). 


4.3 Prove Proposition 4.4. Let (X,7T), (Y,m), and (Z,7 2) be 
topological spaces and let f: X — Y and g:Y —Z be 
continuous functions. Then the function gof:X —Z is 
continuous. 


4.4 Let f: (X,7) — (Y,7) be a function such that f(x) = x, 
X=Y=KE, f=—]1{B,O, {1}, [1,3)}, and 7 — {B, 6,12}, 
[2,4)}. Is f continuous? 


lament 


4.5 Under the conditions of Problem 4.4, set f(z) =x+1. Is f 
continuous? 


4.6 Let f: (X,7) — (Y,7’) bea map. Show that f is continuous 
at a point x € X if and only if, for any base neighborhood 
Bra) of the point f(x), f *(Byc,)) isa neighborhood of «. 
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4.7 Under the condition of Problem 4.6, assume that r’ = r(d), 
that is, (Y,7’) is a metrizable topological space. 


a) Show that f is continuous at x € X if and only if the 
inverse image under f of any open ball Ba(f(x),¢) is a 
neighborhood of x. 


b) Show that, for each open ball Ba(f(a),¢) there is a 
neighborhood U,,(€) such that 


f(Ux(e)) © Ba(f(z),€). 


4.8 Let f: (X,7) — (Y,||- ||) bea map, where Y is an NVS 
over a field F. Show that f is continuous at x € X if and only 
if, for every « > 0, there is a neighborhood U,(¢) € U, such 
that for each y € U,(e), || f(x) — f(y) || <e. 


4.9 Prove the following statement. Let f: (X,7) — (R”, || - |l,), 
where (X,T) is a topological space. Then f is continuous at a 
point x € X if and only if, for every € > 0, there is a neigh- 
borhood U,(e) €U, such that, for all points y € U;(e), 
Ifa) - fll. <e. 


4.10 Prove Theorem 4.9. Let f: (X,7)—(Y,7') be a map 
continuous at some point x € X. If {x,} is a sequence 
convergent to x, the sequence { f (x,,)} is convergent to f(x). 


4.11 Prove Proposition 4.14. Let f,g: (X,T) — (Y,7’) be two 
continuous maps that coincide on some dense set in X. If 
(X,7) is first countable and if (Y,r') is Hausdorff, then 
f=gonX. 


4.12 Prove the statement posed in Remark 4.17. Show that the 
homeomorphic property applied to a family of topological 
spaces between any fixed carriers X and Y defines an equiv- 
alence relation. 
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4.13 Show that (IR, 7.) is homeomorphic to ( — 1,1) with the cor- 
responding relative topology on ( — 1, 1). 


4.14 Is (R,7-) homeomorphic to [ — 1, 1] ? 


212 CHAPTER 3. ELEMENTS OF POINT-SET TOPOLOGY 
NEW TERMS: 


function continuous at a point, 203 

function continuous at a point, criterion of, 203 
continuous function, 203 

continuity of a function, criterion of, 203, 204, 205, 206 
composition of continuous functions, 204 
quotient topology, 205 

continuity of projection maps, 206 

continuous function on a dense set, 208, 210 
homeomorphism, 209 

homeomorphic topological spaces, 209 
embedding, 209 

embedded set, 209 
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5. PRODUCT TOPOLOGY 


Let (Yi,71),---; (Yn; 7) be “factor” topological spaces. One of the 
reasonable ways to define a topology on the Cartesian product 
Q)= Y, x --- x Y,, (earlier introduced in Section 3 for two spaces) is 
to take the collection 


for a family of open rectangles. Because B is M -stable we use it as a 
base for the topology it generates. Indeed, by Theorem 2.8, GB is a 
base for 7(B) that includes all unions of elements of B. We call 7(B) 
the product topology on Q and denote it by Tp. 


The proposition below allows a significant reduction of the base 
b= x 7; for Tp provided we have the knowledge of bases for the 


respective factor topologies. 


5.1 Proposition. Let B= x B; = By x---x By, 


= {Tle : B; € Bi, Redneath, 


i=l 

where B; is a base for ith factor topology T;, 1 = 1,...,n. Then B is 

also a base for Tp. O 
The reader can readily handle the proof in Problem 5.8. 
Any element of B is called a simple or base rectangle. 


The construction below is an effort to reduce further a generator 
for the product topology on finitely many spaces. 


5.2 Proposition. Let S = x S; defined as 
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= {Ts Seo, 75 Levan}, 
i=l 
where S; is a subbase for T; = T(S;), t= 1,...,n. Then, S is a sub- 
base for Tp, that is, T(S) = Tp. O 


Notice that the base for tp of Proposition 5.1 involved in the 
construction of Tp contains finite intersections of subbase elements. 


Any element of S in Proposition 5.2 is called a subbase rect- 
angle. (The proof of Proposition 5.2 is assigned to Problem 5.9.) 


5.3 Proposition. Let S’ be the set of all unit subbase cylinders 


a= LU *(S;) -- {a*(S;): Se 6,7 = Lpieag hs 


i=1 
where S; is a subbase for 7;. Then S' is a subbase for Tp = T(S). O 
(See Problem 5.10.) 


5.4 Example. According to Problem 3.8, the usual topology 7:’ 
on R” and the product topology tp on R” generated by x re) 
coincide, where 7) = Te is the usual topology on R. By Proposition 
5.1, the usual topology 7’ on R” is thus generated by simple rect- 


angles of the form [| (a;,b;), where (a;,b;) C R are base elements 


i=1 


of (IR, 7-). In particular, the system of all rational rectangles is also a 
base for Tp = 7’. According to Proposition 5.3, the system S’ of all 
unit cylinders {7*((a;,};)): (a;,b;) CR, i=1,...,n} is a subbase 
for 72’. [See also Example 2.10(22).] Once they intersect, for different 
indices 2, they form simple cylinders (for some 7's) or rectangles if 


the intersection is of the type (| 7*((a;, b;)) whose family becomes a 
base for 7/’. Therefore, we can say that the usual topology on R” is 
generated by the set of all simple cylinders. O 


From Remark 4.10 we learned that the projection maps are 
continuous relative to the product topology. Furthermore, we have 
the following. 
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5.5 Theorem. Let 2 = [| Y; and 7; :Q — Y; be the ith projec- 
it 


tion map, 1 =1,...,n. Then, the product topology Tp on Q) is the 
weakest topology with respect to which each projection is continu- 
OUS. 


Proof. Let 7 be a topology on (2, relative to which each pro- 


ri a ri Ft - nm 
jection is continuous, that is, suppose 7**(7;) C 7. Then, B = x T; 
= 


Cr. Indeed, if O€ X74 then O =) a7(O;) (where O; € 7;) 
= i=1 

with 7*(O;) € 7, and because 7 is M -stable, O € T. Finally, because 

B is a base for Tp, by Theorem 2.4 (7), Tp CT. O 


5.6 Remark. Now, we extend the notion of the product topology 
of finitely many factor spaces to the product topology on the Carte- 
sian product of arbitrary many factor spaces. We therefore assume 
that {(Y:, 7): « © X} is an arbitrary indexed family of topological 
spaces. Let us consider two different models of product topologies 
on the Cartesian product 2 = [| Y,. One of them, called the box to- 

vExX 
pology (in notation 7g), is similar to that for finitely many factor 
spaces and is subject to the following construction. As a base for Tz, 
we take the system of box rectangles, 


Bu = { 10s: 0,€ eh, (5.6) 


vExX 


or even a weaker base, 


Bo={ 1B. Be B.S (5.6a) 


rEX 


Hence, this way the formed box topology Tg is not different from its 
version for finitely many factor spaces and it likewise preserves the 
continuity of projection maps. The box topology was introduced by 
the Austrian mathematician Heinrich Tietze in his 1923 paper “Bei- 
trage zur allgemeinen Topologie I” (see [Tiet]). 
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There is another, “more economical” topology on (, which also 
preserves continuity of projection maps, and in addition, leads to a 
tamer formation of the widely used “pointwise topology” (which the 
box topology does not as we show it). 


5.7 Definition. Let us define the topology tp on {2 through the 
base 


B= wil I] 0.) : A's are finite subsets of XY, O, € rah 
rEA 
(5.7) 
where 


pies es nal 
rEX acA 

is the A-projection map, according to Definition 5.4 (v), Chapter 1. 
In other words, 


ce { [] Oz: Oz € re (5.7a) 
rex 

where O, = Y,, except for finitely many indices x € X. Thus, all 

elements of B are simple cylinders (see Definition 5.4 (iii), Chapter 


1). The topology Tp (or in notation, rather ( pe »)) generated 
rex 

by such a base is called the product topology or Tychonov topology 

on 2. We notice that although the box topology is also a product 

topology on 92, the Tychonov topology is most often referred to as 

the product topology. O 


The product topology in Definition 5.7 was first introduced by 
the Russian mathematician Andrei Tychonov in some rudimentary 
form, in 1930. He generalized this concept to its present form in his 
later article, “Ein Fixpunktsatz” (see [Tych]). The same formalism of 
the product topology was also due to the Czech mathematician 
Eduard Cech (1893-1960) in 1937 who seemed to be unaware of 
Tychonov's papers. 


Obviously, base B of (5.7 -5.7a) for Tp can be further reduced if 
each O,, is selected from a base 6, for 7,. (See Problem 5.15.) 
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5.8 Remarks. 


(i) Let S, be a subbase for 7,,. One can show that the collection 
S = {*(S,): S, € S,, x € X} of unit subbase cylinders is a sub- 
base for Tp, just as it took place in the case of finite products. (See 
Problem 5.16.) 


(ii) We always prefer to deal with the smallest possible base or 
a subbase for Tp, provided that we have the foreknowledge of 7;,'s, 
such as their bases or subbases. For instance, as a rule of thumb, we 
can take {7%(O,): Oz € T,} as a subbase for Tp, unless more is 
known about the nature of 7,,'s and thus the subbase will be further 
reducible. i 


5.9 Examples. 


(i) Let {(¥:,7,): a € X} be a collection of metrizable topo- 
logical spaces and let (2. = [| Y,. According to Example 2.3 (7), the 
rExX 
collection of all open balls B,(y2,r), ye € Yr, forms a base for 
(Y;, Tr(d,)). Now, the set of all simple cylinders of the type 


k 
Thoy.-.s0} (112, (Yi, r)) 


= Te, (Bri (M, r1)) M Te, (Bro (yo, cl) ae M Te, (Bry (Yes Tk)), 
a,EX, YE Pigs oo ane 
(5.9) 


forms a base for Tp, whereas the collection of all unit cylinders of 
the type 7° (Bn, (yx, 7x) ) must form a subbase for Tp. 


(ii) Let Q = R® be the collection of all real-valued functions on 
R regarded as the Cartesian product of R's, with each R endowed 
with the usual topology. We select an open neighborhood U; of a 
point f € Q. First of all, according to (5.9), a simple cylinder with 
base (yi — €1,y1 + €1) X--- X (YR — Ek, YR + Ex) has the form 


k 
Te, ou) (11 eee “)) 


i=1 
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= 7, (Mi — ery tei)| ON... Ams, [Ye — Ex, YR + €x)], (5.9a) 


where y,, is a point in Y, = R. In order that this cylinder becomes a 
neighborhood of f, we need to replace y, with the corresponding 
traces f(a.) of f in the factor spaces Y,,..., Yo,: 


k 
Titers ah GG (0%) — €;, f(a4) + <i). (5.9b) 


Figure 5.1 


O 


5.10 Remark. Let {g, : x € X} be a family of functions g, : 2 
— Y;, where each Y,, is endowed with a topology 7,. Recall that 
g.(Tr), Va € X, is a topology on 2, and that each function g, is 
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continuous relative to this topology. The union of all these topol- 
ogies, 


S= U Gs (Tx); 


rEX 


need not be a topology, for it does not necessarily preserve unions 
and intersections. But we can extend it to a topology, say 7(S), re- 
garding S as a subbase. This topology is the weakest one relative to 
which all functions of the above family are continuous. Consequent- 
ly, T(S) is called the weak topology generated by the family {g.;}. 


Now, taking I Y, for Q and 7, (the xth projection map) for g,, 
LE 


by Problem 5.19, the Tychonov topology Tp is the weakest topology 
relative to which all projection maps are continuous. 


Furthermore, by Problem 5.16, U. m**(S,) is a subbase for Tp. 
LE 
Other relevant subbases for rp that can be used are U i, (Gs) Or 
rE 
L) 7**(7,). In a nutshell, 7p turns out to be the weak topology gene- 
oi by the projection maps. Consequently, this offers an alternative 
definition of the Tychonov topology on (2 = I Vins O 


5.11 Example. Recall that a sequence {y,,} C 2 converges to a 
point y € 2 if, for every neighborhood U,,, there is an N(U,,)-tail of 
{yn}. Under the conditions of Example 5.9(i2), if Q is the product 
space 2 = R®, a sequence of points {f,,} C Q is convergent to a 
point f € 2 ifand only if f,,(a) — f(x) for each x € R. To see this 
we note that a base neighborhood U; of f in (5.9b) is of the form, 


Ug=4g 6 0): |F (a7) = gl)| < ep = 102.4F}s GAT) 


which is just another expression of (5.9b). Therefore, f,, — f if it is 
close to f on each finite set {2),...,7;,} CX =R, specifically on 
singletons {x} C R. O 


Example 5.11 motivates the following notion. 
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5.12 Definition. Let {(Y;,7,) : « € X}, be an indexed family of 


topological space and let { |] Y,,7p} be the Tychonov product 
rex 
topology. Recall (Remark 5.2 (ii), Chapter 1) that if Y, = Y and 
Tx =T, for each x € X, then we denoted TT Y, by Y* and called it 
LE 


the set of functions from X to Y. Now, in this particular case of the 
Tychonov product topology, in light of Example 5.11, (Y*,7p) is 
called the topology of pointwise convergence. O 


As a generalization of Example 5.11, the following proposition 
can help solidify our understanding of the topology of pointwise con- 
vergence. 


5.13 Theorem. Let {f,,} be a sequence in Y*. Then fy, — 
f < Y% (in the topology of pointwise convergence) if and only if 
fr(x) — f(x), Va € X [in the topology (Yx, Tx)]. 


Proof. Recall that 7,: Y* — Y,=Y is the x-projection map 
defined as 7,,(f) = f(a) [see Definition 5.4 (i), Chapter 1]. 


(i) First assume that f, — f in (Y*,r7p). By Problem 5.19, 7, 
is continuous for every x. Thus, by Theorem 4.9, 7,( fn) — T2(f). 
This yields that f,,(2) — f(a) in (Y2, Tr). 


(it) Now assume that f,(x) — f(x) in (Y;,7,), Va € X. We 
need to show that for any base neighborhood 6; of f, there is an 
N(B,)-tail of {f,,}. Clearly, By is a simple cylinder of the form 


k 
By — I tt (Ir Or) ’ 


where Of(,,,) contains f(x;). 


By our assumption, f;,(z;) — f(x;), which implies the existence 
of an N;(Of,,))-tail, i=1,...,k. Let N = max{Nj,..., Ng}. 
(Note that this is exactly the place where we take advantage of the 
Tychonov product topology, for otherwise, in the case of the box 
topology, a base neighborhood of f could not be represented by a 
simple cylinder. The latter is an obstacle to finding a finite maximum 
of infinitely many N;'s implying that {f,,} does not converge to f in 
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this box topology.) Then, for each x;, 7 =1,...,k, we have the 
N(Oj(q,))-tail of {f,(vi)}, which yields that 


fn € mC inl@d}) S 73, pa))s = 1... Ke 


Therefore, we have 


k k 
fn © 1) 75, fn(@a)) S (1) 2% (Oza,)) = By, foralln > N. 
i=l i=l 


The latter yields the existence of the N(By)-tail of {f,}, and 


therefore, f, — f in (Y*,7p). O 
PROBLEMS 
5.1 Let (Y,,7™) and (Y, 72) be topological spaces, let (0, Tp) be 


5.2 


5.3 


5.4 


3.0 


the product space, and let F; € 7/,7 = 1,2. Show that F, x F 
ate. 


Let (Y;,7;),2 =1,...,n, be topological spaces, (Q, Tp) be the 

product space, and let F; € 7/,i =1,...,n. Show that |] F; 
i=l 

E Tp. 


Show that Int(A x B) = Ax B. 


Let (Y;,7;),2 =1,...,n, be topological spaces and (Q, Tp) be 
the product space. Show that int( A) = ILA: 

i=l i=l 
Let (Yi, 71) and (Yo,72) be topological spaces, let (2, Tp) be 
the product space, and let A C Y; and B C Yo. Show that 


AxB=AxbB. 
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5.6 


hyd 


5.8 


39 


5.10 


5.11 
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Let (Y;,7;),2 =1,...,n, be topological spaces and (Q, Tp) be 
the product space. If A; C Y;,72 = 1,...,n, show that 

[J A: = TA: 

i=1 i=l 
Let (Y;,7;),2 =1,...,, be topological spaces and (Q, Tp) be 
the product space. Let A; C Y;,i = 1,...,n. Is it true that 

a({14\) =||[eA;? 

i=l i=1 

Prove Proposition 5.1. Let 

B=B,x--:x Bn: = {B, x---x B,: B; € B;} 
where B; is a base for 7;, 1 =1,...,n. Then B is also a base 
for Tp. 
Prove Proposition 5.2. Let 
S=S,xX-:-xX Sy, 

= {$1 x4 Me S,t Oo eo, t= ere 
where S; is a subbase for T; = T(S;), i =1,...,n. Then S is a 
subbase for Tp. 
Prove Proposition 5.3. Let 
S = {7*(\S;) : 5,€S8;,, += Lieueggihs 

where S; is a subbase for 7; Then S' is a subbase for 
ip =7(S), 
A map f: (X,7T) — (Y,7’) is said to be open if f..(7) C 7’. 


Show that in the product topology each projection map is open. 
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5.12 


5.13 


5.14 


5.15 


5.16 


(Hint: Use the fact that, according to Problem 3.3, Chapter 1, 
maps preserve unions. ) 


Let f: (Q,7) > (a = [vite and 7;:(2— Y;. Show 
i=1 
that the function f is continuous if and only if each 7; 0 f is 
continuous. [Hint: Show that f*(S') € 7, for every subbase 
element of rp, and then apply Theorem 4.8.] 


Let (Y;,7;) be a Hausdorff space, i= 1,...,n. Prove that 
(11%.7») is Hausdorff. 


i=1 


Let (Y:,T7r),c € X, be an indexed family of topological 
spaces and (2, Tp) be the Tychonov product space. Let A be 


an arbitrary subset of X and F), € r£,a € A. Show that || F, 
xEA 
is closed in Tp. 


Let {(Y:, Tr) : « € X} be an indexed family of topological 
spaces and let 


B= {mi( ih B.) : A's are finite subsets of X, B, € B.}, 


rEA 


where B,, is a base for 7,. Show that B is also a base for the 
Tychonov topology Tp on 22. 


Let (Y:,Tr),« € X, be an indexed family of topological 
spaces such that S, is a subbase for 7,, x € X. Show that the 
collection of all unit subbase cylinders 


S=U a (Sx) = {t7(Sz): Sz € Sx, 2 € X} 
rEX 


[in Remark 5.8(2) and Remark 5.10] is a subbase for the 
Tychonov topology. 
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5.17 


5.18 


19 


5.20 


5.21 


5.22 
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Let {(Y:,7) : a2 € X} be an indexed family of Hausdorff 


spaces. Prove that { |] Ye,7») is Hausdorff. 
vEX 


Let (2 =|] ¥, re) be the Tychonov topology and assume 
crEx 

that each factor space is first countable. Is (Q,7p) first 

countable if: 


by |X| 2 ¢? 


Generalize Theorem 5.5 for the case of Tychonov's topolo- 

gy: Let Q=]][ Y, and m,:Q — Y, be the xth projection 
ieX 

map. Then the Tychonov product topology Tp on Q) is the 

weakest topology with respect to which each projection is con- 

tinuous. 


Let (2 =|] 7. r») be the Tychonov topology and assume 
rEx 

that each factor space is second countable. Is (2, 7p) second 

countable if X is countable? 


Let (0 =|] %; r») be the Tychonov topology of (Y;, 7.) 
rex 
factor spaces. If A, C Y,,72 € X, show that 


rExX TEX 


Let (2 =|] Yu, re) be the Tychonov topology and assume 
rex 

that each factor space is separable. Is (Q., 7p) separable if X is 

countable? 
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NEW TERMS: 


b= x T;, base for product topology, 213 


product topology of finitely many factor topologies, 213 
base (simple) rectangle in product topology, 213 
simple (base) rectangle in product topology, 213 


x S;, subbase for Tp, 213 


subbase rectangle, 214 

continuity of projection maps, 215, 224 

product topology of arbitrarily many factor topologies, 215 
box topology, 215 

box rectangle, 215 

Tietze, Heinrich, 215 


( ie 7) , Tychonov product topology, 216 
rex 

Tychonov, Andrei, 216 

Cech, Eduard, 216 

open neighborhood of f, 218 

weak topology (generated by a family of functions), 219 
(Y*, rp), topology of pointwise convergence, 220 
pointwise convergence, criterion of, 220 

open map, 222 
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6. NOTES ON SUBSPACES AND COMPACTNESS 


It has been mentioned that subspaces of topological spaces (that is, 
relative topologies) inherit certain features of the original spaces. In 
this section we consider this notion more systematically. We are con- 
cerned with such topological properties as separability, countability, 
and compactness, and their impact on subspaces. 


6.1 Definition. A property of a space is referred to as hereditary 
if every subspace has this property. A property is said to be weakly 
hereditary if it is inherited by a subspace whose carrier is closed in 
the original space. A property is vaguely hereditary if it is inherited 
by a subspace whose carrier is open in the original space. (The last 
notion is restricted to the use in this textbook.) O 


6.2 Example. Second countability is hereditary. (See Problem 
ae O 


6.3 Remark. In Section 1 we denoted by A the closure of some 
subset A of a topological space (X,7), understanding that this is the 
closure relative to the topology 7. As mentioned in Definition 1.8(7), 
in the case of subspaces we may need to deal with closures of subsets 
with respect to any relative topology, say (Y, Ty). To make a certain 
distinction clear we then write C/A. However, we will still use A 
whenever we have in mind the closure relative to the original space 
(Aor O 

6.4 Example. The property of density of a set is not hereditary 
and not weakly hereditary, that is, if D is dense in (X, 7), its trace in 
a subspace (Y,7y) need not be dense. Let (X,7) = (R,7-) and 
Y =R, U{ — 2}. Then, obviously the set Q, = QNY is not 
dense in (Y,7y,). It is easily seen that { — \/2} is an open neighbor- 
hood of the point - J2 that does not meet Q,. Thus Cly QQ, 4 Y. 
Because Y is closed in (R,7.), the density property is not weakly 
hereditary either. O 


6.5 Theorem. Separability is vaguely hereditary, but is neither 
hereditary nor weakly hereditary. 


6. Notes on Subspaces and Compactness 227 


Proof. 


(i) Let (X,7) be separable and let (Y,7y) be a subspace of 
(X,7) such that Y € 7. We show that (Y,7y) is separable. Let D be 
a countable dense set in (X,7T). We need to prove that Cly(DNY) 
= Y; specifically, we need to show that Y C Cly(DNY), for the 
inverse inclusion holds trivially. Let y be any point of Y and let U, 
be any open neighborhood of y in ry. Because Y is open in X, U) is 
also a neighborhood of the point y in T. (It is easy to show the fol- 
lowing. Since U, is a neighborhood of y in ty, there is 0; C U, 
which is ty-open. But O,, = O,  Y, where Oy € 7. Because Y is T- 
open, it follows that O,, is also r-open and, clearly, U;, € Uy in 7.) 


Therefore, U;, meets D and, consequently, U;, meets DN Y (asa 
subset of Y). Observe that if Y is not open in X, U;, need not be a 
neighborhood of y in Y. [For instance, let Y = (0,2] and 
Us = (1, 2]. Clearly, U3 is not a neighborhood of 2 in (IR, 7), but it 
is a neighborhood of 2 in (Y, ty).] 


(ii) As a counterexample of separability as a hereditary property, 
we consider the topology (X,7) known as the Moore plane. Let 
X = R x [0, co) (the upper semiplane with the horizontal axis). The 
topology on X will be described by the following base sets. At each 
point (x,y) € R x (0,00), a neighborhood base is the collection of 
all open balls {B((z,y), r): r< y}, where B(z,r) ={z'EX: 
d.(z,z') <r}. At each point (2,0), a neighborhood base consists 
of all open balls touching the horizontal axis at (2,0), and the point 
(x, 0) is attached to these balls. 


Take the union of all neighborhood bases and construct a base for 
the Moore plane topology in light of Theorem 2.2. This topological 
space is separable with the dense countable subset D = Q? 7X. 
Indeed, let (x,y) € R x (0,00). Then any neighborhood of (x, y) 
contains points with rational coordinates (a property inherited from 
the Euclidean space). As for the points (x,0), any open ball 
bordering (x, 0) also contains points with rational coordinates. 


Now, for a subspace of the Moore plane, consider one with the 
horizontal axis as the carrier Y. Clearly, all singletons are traces of 
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base neighborhoods at (xz,0) in Y yielding the discrete topology as 
the relative topology on Y. According to Problem 6.2, any discrete 
topology with an uncountable carrier is not separable. Observe that 
Y* is obviously open in X. Hence the separability is not weakly 
hereditary. O 


6.6 Definition. A subset A of a topological space (X,7) is said 
to be compact (or Lindelf'‘compact or just Lindif\ _) if every open 
cover of A contains a finite (at most countable) subcover. We also 
say that A is finitely (countably) reducible. Specifically, if X is 
compact (Lindel6éf), (X,7) is called a compact topological space 
(Lindelf space). O 


6.7 Example. Compactness in metrizable topological spaces ob- 
viously coincides with that for the corresponding metric spaces. In 
this case, we may use the tools and criteria of compactness for metric 
spaces. For instance, the interval [a,b] for a,b € R” is compact in 
the sense of the Euclidean metric; thus, it is compact in (R”,7.), 
whereas (a, b) is not compact in (IR", 7-), because it is not closed. O 


Theorem 6.8 below is almost identical to Theorem 6.10 of 
Chapter 2. However, for consistency, we formulate it and prove it 
again. 


6.8 Theorem. Let f : (X,7) — (Y,7') be a continuous function 
and let K be a compact subset of X. Then the image f,(K) is com- 
pact. In short, the image of a compact set under a continuous func- 
tion is compact. 


Proof. Take any open cover {O/: 7 € I} of f,(K) to have f,(K) 
CU O; . Then, 
iel 


KC P(E) SU F(O}). 


iel 


Because f is continuous, f*(O/) is open, and because K’ is compact, 
there is a finite subcover of sets f*(O/), without loss of generality 
indexed by 1,..., n, that is, 


KCU f*(O}). 
k=1 
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Because by Problem 3.6(a,b), Chapter 1, maps preserve inclusions 
and unions, 


feK) SU fel F(OK)) = U Of. o 


6.9 Theorem. Compactness is weakly hereditary (that is, a 
closed subset of a compact topological space is compact). 


Proof. Let (X,7) be compact and let B C X be closed. Let 
{O; : 1 € I} be an open cover of B. Because B° is open, {B°,O;: 
i € I} is an open cover of X. Because X is compact, there exists a 
finite open subcover of X, say {B°,O1,...,On}, which is also an 
open subcover of B. Hence, B is compact. O 


Hausdorff topological spaces have an important property with 
respect to compactness. 


6.10 Theorem. Every compact subset of a Hausdorff space is 
closed. 


Proof. Let A be a compact subset of the Hausdorff space (X,7). 
We show that A° is open. Take x € A‘. The family of neighbor- 
hoods of all points y<€A covers A. We extract a particular 
subfamily of these neighborhoods. Because (X,7) is Hausdorff, for 
each y € A, there is a neighborhood U,(y) of x and a neighborhood 
V,(a) of y such that U,(y) NV,(«) = ©. Without loss of generality 
we may assume that the family {V,(x) : y € A} is an open cover of 
A. Because A is compact, there exists an open subcover {V,, (x) : 
k=1,...,n} of A. Obviously, V,,(2)NUz(y,) =@. Select 
{Ox(yx) © Ux(yr), k = 1,...,n}, whose intersection (denoted by 
O,,), because being finite, is open and nonempty. Therefore, O,, is an 
open neighborhood of x € A° with O, N A = @, which means that x 
is an interior point of A°. Thus, A° is open, or equivalently, A is 
closed. O 


6.11 Remark. In Theorem 6.3, Chapter 2, we stated and proved 
(Problem 6.8) the equivalence of the conditions: (i) A C (X,d) is 
compact; (iz) every infinite subset of A has an accumulation point in 
A (Bolzano - Weierstrass compactness); and (iii) every sequence in 
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A has a convergent subsequence (sequential compactness). This 
equivalence does not hold for topological spaces, where (i) and (777) 
are in general distinct properties, and compactness just implies 
Bolzano - Weierstrass compactness, as the reader proves (see Prob- 
lem 6.6). O 


Recall that second countable spaces are first countable and 
separable (see Problem 3.6). In addition, they are Lindeldf spaces, as 
the following theorem asserts. 


6.12 Theorem. Any second countable topological space is 
Lindelf.. 


Proof. Let B be a countable base of a topological space (X,7) 
and let O= {O;: i € I} be an arbitrary open cover of X. Let 
x € X. Then x belongs to some O, € O. Because B is a base for 7, 
by Theorem 2.2, there is a neighborhood base B,, C B. Then there is 
a base neighborhood B, € B, such that B,C O,. The collection of 
all distinct B,'s for all x © X is at most countable. On the other 
hand, this collection obviously covers X. Consequently, the 
collection of all open supersets {O,,} associated with each B, is also 
countable and it covers X. Thus (X,7) is indeed Lindel6f. CO 


The result below is in the spirit of Theorem 4.8, where, for conti- 
nuity of a function f from (X,7) to (Y,7(S)), it was sufficient to 
verify that f *(S) C r. Here we claim that, if S is a subbase for 7, 
then X is compact whenever every cover of X by elements from S 
can be reduced to a finite subcover. 


6.13 Theorem (Alexander Subbase Theorem). Let (X,7) bea 
topological space and let o be a subbase for T. If every open cover of 
X by elements of o is finitely reducible, that is, if every open cover 
can be reduced to a finite subcover, then X is compact. 


Proof. We prove the equivalent statement. Jf X is not compact, 
then there exists an open cover by elements of o that is not finitely 
reducible. 


Assume that X is not compact. We prove this assertion in four 
steps, which we outline as follows. 
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(i) Let O be the collection of all open covers of X that cannot 
be reduced to finite subcovers. We show that O has a maximal 
element; call it M. 


(ii) We show that for every x € X, there is an open set M, € 
M and a finite tuple of open sets {S)(x),...,S,(x)} C o such that 


x €()S_(a) C My. 
k=1 
(iti) We show that at least one of the sets {Sj(),...,5,(x)}, 
denote it S(x), belongs to M. 


(iv) We recognize that for each rE X, S(x)€o and 
S(a) € M. In particular, the latter implies that {S(x) : x € X} is 
an open cover of X, which is not finitely reducible. On the other 
hand, because we will have {S(a): 7 € X}Co, the proof be 
complete. 


We are be concerned with each of the above steps in detail. 


Step (7): Because X is not compact, O is not empty. Introduce on 


O the partial order relation in terms of the inclusion. (In other words, 
two open covers, C; and C2 of X from O are related as C; C C2 if 
and only if C; is a subcover of the cover C2.) Let C C O be any 
chain, and let U/ be the union of all elements of C. Clearly, U/ is a 
cover of X that cannot be reduced to a finite subcover. Thus, 7/ must 
belong to O, and U/ is an upper bound of C. By Zorn's Lemma 4.13, 
Chapter 1, there is at least one maximal element in O; denote it by 


M. 


Step (12): Let x € X and let M, € M such that zx is an interior 
point of M,, (which exists, for M € O is an open cover). On the 
other hand, by Theorem 2.7, the collection 6 of all finite inter- 
sections of elements of the subbase o is a base B for rT. Thus, as an 
open set, M/, is a union of base elements, each one of which is a 
finite intersection of elements of a; that is, x belongs to one of the 


n 
base elements B,,, represented by a finite intersection () S;(a) of 
k=1 
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subbase elements. In other words, there is a tuple {5i(x),..., 


Sn(z)} C o such that x € () S;, (x) C M,. 
k=l 


Step (772): Assume that for the given set /, there is no element 
of the tuple {S,(a),...,5;,(2)} which is an element of M. In this 
case, for each S;(a) € {Si(x),...,S,(a)}, there is a finite subcol- 
lection {M,(k),...,M;,(k)} from M that supplements 5;,(x) to a 
cover of X. If no such finite subcollection were to exist, then 
{S;(a)} UM would be an element of O, that is, {.S;(a)}UM 
would not be finitely reducible, and M would not be a maximal 
element of O. Hence, {M4,(k),..., M;,(k), S4(x)}, k =1,...,n, is 
a finite open cover of X. By Problem 6.9, 


{My(k),...,Mj,(k):& =1,...,n, and 1 Sel2)} 


is also a finite open cover of X. This implies that 
{My (k),...,M3,(k): k= 1.s09 7, and M,} 


is a finite cover of X, and is also a finite subcover of M, which 
contradicts the property that M is an element of O. Thus, our 
assumption about {S(a),...,5;,(a)} was wrong and there is at least 
one set S(x) € {5)(x),...,S,(a)} that belongs to M. 


Step (2v): It follows that, for each x € X, there is an element 
S(a) common to o and M, and thus the collection {S (x): « € X} 
is an open cover of X that obviously cannot be reduced to a finite 
subcover. (Observe that the assumption that X is not compact 
implies that X cannot be finite.) O 


The Alexander subbase theorem leads to the following meaning- 
ful result by Tychonov. 


6.14 Theorem (Tychonov). 4 nonempty Tychonov product 
(2 =|] 75 r») is compact if and only if each factor space (Y,, Tx) 


vExX 
is compact. 
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Proof. 


(i) If the Tychonov product is compact, then compactness of 
factor spaces follows from the continuity of projection maps (see 
Problem 5.19 and Theorem 6.8). 


(ii) Let (Q,7p) be the Tychonov product of compact spaces 
(Y:,;Tr), « € X. Take for a subbase for rp the collection of all 
simple cylinders S = {m*(O,): Ox € Tr, x € X} (see Proposition 
5.3). If Q is not compact, by the Alexander subbase theorem, no 
subcollection of S covering (2 can be reduced to a finite subcover. 
Specifically, S itself cannot be reduced to a finite subcover. Let 
{O,} be an arbitrary open cover of Y,. Then it can be reduced to a 
finite: subcover, {O,(1),..<;O,(k,)} of Yq. Thus, {a2(O1)), x, 


oa 


1*(O,(kx))} is a finite open cover of (2. that is a finite subcover of 


x 


the largest cover, S. This contradicts the hypothesis that Q is not 
compact. O 
PROBLEMS 


6.1 Is the property of a set to be dense vaguely hereditary? (Con- 
sider Example 6.4.) 


6.2 Let (X,P(X)) be a discrete topological space with an un- 
countable carrier. Show that the space is not separable. 


6.3. Show that the topological space in Problem 6.2 is not second 
countable. 


6.4 Show that first countability is hereditary. 
6.5 Prove that the Moore plane is not second countable. 


6.6 Prove the statement. Every compact topological space is 
Bolzano - Weierstrass compact. 
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6.7 


6.8 


6.9 


6.10 


6.11 


6.12 


6.13 


6.14 
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Let 7 be the cofinite topology on an arbitrary nonempty set X. 
Show that (X, 7) is compact. 


Show that any bijective continuous map f: (X,7) — (Y,7), 
where (X,7) is compact and (Y,7’) is Hausdorff, is a 
homeomorphism. (Hint: Make use of Proposition 4.3.) 


Let {Mi (k),...,M;,(k), S,} be a cover of a set X for each 
k =1,...,n. Show that {M)(k),...,Mj,: k=1,...,n, and 


() Si} is also a cover of X. 
k=l 


Let (X,7) be a Hausdorff topological space. Show that an 
arbitrary intersection of compact sets in X is compact. 


Let O be an open set in R”. Show that there is a monotone 
increasing sequence {O;} 1 of open bounded subsets of O 
such that {O;}7 O. 


Is Hausdorffness hereditary in a strong or any weak sense? 


Let (X,7) be a Hausdorff space and C' and Kk be disjoint com- 
pact sets. Show that there are disjoint open supersets, U and V, 
of C' and K, respectively. 


A topological space (X,7) is called countably compact if any 
open countable cover of X has a finite subcover. Prove that the 
following are equivalent. 


(2) (X,7) is countably compact. 


(72) Each countable family of closed sets in (X,7) with the 
finite intersection property (that is, the intersection of 
any finite subfamily is nonempty) has a nonempty 
intersection. 
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(zi7) Every countably infinite subset A of X has a point x 
with the property that each neighborhood of x contains 
infinitely many points of A. 


(zv) Every sequence in X has a closure point. 


6.15 Let K be the family of all compact subsets of a topological 
space (X,7). Under the formation of which set operations is KC 
closed? Give counterexamples of those set operations, which 
are not contained in K. 
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Chapter 4 


Measurable Spaces and Measurable 
Functions 


In the previous chapter we studied general topological spaces. A 
topology was defined as a collection of sets (on a carrier) that is 
closed with respect to the formation of arbitrary unions and finite in- 
tersections. In the present chapter, we introduce various classes of 
sets similar to topological spaces but serving other purposes. One of 
them prepares the reader to another part of analysis, integration. Be- 
yond the familiar integration we experienced in calculus, we need to 
measure much more general sets than those used for the Riemann 
integral. For instance, we consider abstract sets that are encountered 
in the theory of probability. In addition, we largely extend the ex- 
isting class of integrable functions. 


If we try to measure the length (or area) of all sets, set theory 
drives us into certain contradictions or paradoxes. Therefore, we 
have to restrict our attention to measuring a subclass of sets. It stands 
to reason that we would want the collection of “measurable” sets to 
be closed under certain operations such as union, complementation, 
and intersection. Thus we seek a collection of sets satisfying certain 
algebraic properties under the binary operations of union, intersec- 
tion, and set-theoretic difference. This leads to the concept of a sig- 
ma-algebra. 


As with topological spaces, where base (or sometimes also 
subbase) sets were most convenient to study, in measure theory it is 
also useful to start with more primitive collections of sets called 
generators of sigma-algebras. For instance, if we need to measure a 
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flat closed figure, one of the reasonable ways to render it is to appro- 
ximate the figure by a number of specified (disjoint) rectangles 
whose measures we already know. Such a natural way of measuring 
more complex sets by “base” sets gives rise to the extension of meas- 
ure from the collection of “abstract rectangles” to the set of all 
figures formed from rectangles under countable set operations. This 
way of extension was formalized by the German mathematician 
Constantin Carathéodory in 1918. 


The present chapter is just preparation for the forthcoming two 
chapters, where we are concerned with various classes of sets on 
which measure will be defined and then extended. We will be 
introduce to some generators of these classes, in particular, topo- 
logies. The latter are of special interest. 


1. SYSTEMS OF SETS 


1.1 Definition. Let ( be an arbitrary set and let &’ be some 
collection of subsets of 2. 


(i) is called a c-algebra (pronounced “sigma-algebra”) or 
o-field (sigma-field) if: 


(a) QE DX. 

(b)AE LX => APES. 

(c) For any sequence {A,,} of sets of ©’, U A, € &. 
1 


n= 


(ii) 7 is called an algebra (or field), also denoted by A (that 
is, 3 = A)if 


(a)QEA. 

b)AECE > AEA, 
k 

(c) {Adj ang Agt CY => U An EA. 
n=1 


(itz) 3 is called a Dynkin system, denoted by D (X’ = D), if 
(a) QE D. 
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(b)AED => AED. 


(c) For any sequence {A,,} of pairwise disjoint sets of D, 


Sion =D: 
(iv) is called a ring, denoted by R (’ = R), if 
(a)OER. 
(b)A,BER => A\BER. 
(c)A, BER > AUBER. 
(v) is called a semi-ring, denoted by S (7 = S), if 
(a) OES. 
()A BES > ANBES. 


(c) for A,B eS, there is a finite tuple C),...,Cy of 
pairwise disjoint sets from S such that A\B can be 
represented as the union yas 


(vi) + is called a monotone system, denoted by M (X) = M), 
if: 


(a) for every {A,}{ (that is, monotone nondecreasing) 
sequence of sets of M, 
LJ Ane M. 
n=1 
(b) for every {A,,} | (that is, monotone nonincreasing) 


sequence of sets of M, 
CO 


An € M. 
—aii 


(vii) M-stableifA,BeX > ANBesD. 


(viit) A pair (Q, %’), where X’ is a o-algebra in Q, is called a 
measurable space, and elements of 5’ are called measur- 
able sets. (Compare these with topological space and open 
sets, respectively. ) O 
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1.2 Examples. 


(i) Let Q be an arbitrary set. Then {,@} is the smallest 
algebra (o-algebra). 


(it) P(Q), the power set, is always a o-algebra. It is the largest 
o-algebra in (2. 


(iit) The smallest o-algebra containing a set A is obviously 
{0Q,@, A, A°}. 


(iv) Let 2 = R” and let S be the system of all n-dimensional 
half-open intervals (or rectangles) of type (a,b], for a,b € R". The 
intersection of two intervals is either @ or again an interval. The dif- 
ference of two intervals need not be an interval, but it can be 
represented as a finite union of pairwise disjoint intervals (see Figure 
1.1). Hence, S is a semi-ring. 


Xo 
LLL 


(v) Every o-algebra »’ has three properties directly following 
from Definition 1.1(7): 


a) @ € XY? (because @ = 1°). 


b) For every sequence {A,} C X’, (] An € &’. This holds 
n=1 
because 


{An} CE > {AFD s 


Aes s (U 4s)" ¢ Se aes. 


n=1 n=1 n=1 
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Observe that by applying De Morgan's law we can similarly show 
that this property and property (c) in Definition 1.1(7) are equivalent, 
that is, in 1.1(i, c) a countable union can be replaced with a 
countable intersection. 


c) Finally we have 
A,Be xi => A\BeE » (dueto A\B= ANB’). 


One can say that any o-algebra is closed with respect to the form- 
ation of all countable set operations. 


(vi) Every algebra A has the same property as o-algebras in (v) 
except it is closed under finite intersections. Hence, any algebra is 
closed relative to the formation of all finite set operations. 


(vii) Let 2 be an arbitrary set and let &’ be the system of all 
subsets A of »’ such that either A or A‘ is at most countable. Then 
+? is a o-algebra. Indeed, @ is at most countable. Thus, 2 € ’. If 
Aeé S’ then obviously A° € 3’. Now let {A,} C X’. Then either 
there is at least one countable set A;, or else all A, are not countable 


[e.¢) 
but their complements are countable. In the first case, (] A, is 


n=1 
[ee] 
clearly countable. In the second case, the set |) A® is at most count- 


n=1 


able and, therefore, it belongs to 5’, along with its complement. The 


latter, by De Morgan's law, is () A,. Consequently, »’ is a o-algebra. 
n=1 

(viit) Let f: Q — Q’ be a map and %” a o-algebra in 2’. Then 
3} = f**(2’) defined as the system of all sets {f*(A’): A’ € &”} is 
a o-algebra. This property is due to the fact that the inverse of a map 
preserves all set operations. For instance, if A € 3’, then it follows 
from the definition of ’’ that A is the inverse image of some 
A’ € X”. Thus (A’)° € &” yielding A° = (f*(A’))° = f*((A’)°) € 
+’. The proof that the union of a sequence from »’ belongs to »’ is 
similar (show it). 


(ix) A monotone system need not be an algebra. For instance, let 
€ be the set of all convex subsets of R?. Then it is easily seen that € 
is a monotone system. However, the union of two convex sets A U B 
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need not be convex. The difference of two convex sets need not be 
convex either. On the other hand, an algebra need not be a monotone 
system, for it is not closed under countable set operations. 


(x) The collection of all finite subsets of an infinite set 2 is a 
ring in). O 


PROBLEMS 


1.1 Let »’ be a o-algebra in 2) and let 1)’ be a subset of 2. Define 
Sas VN = {ANM: Ae LX}. Then &” (the trace of 5 
in ’) is a o-algebra in 1)’, called a trace o-algebra . Prove it. 


1.2. Show that any ring is closed under the formation of finite inter- 
sections. 


1.3 Let R bea collection of subsets of (2 with the properties: 
®eRandA,BER > ANBE Rand AUBE R. 
Is R a ring? 


1.4 Let A be an algebra in 22. Prove that A is a o-algebra in (Q if 
and only if A is also a monotone system. 


1.5 The flowchart below reflects the relations between some 
systems of sets 


D 


tt 
+ =>AxsR=S => N-sstable 


4 
M 


Demonstrate that each relationship holds. 
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1.6 


1.7 


1.12 


Give an infinite collection of subsets of R that contains @ and 
R and which is closed under countable unions and inter- 
sections but is not a o-algebra. 


Let ( be a finite set with |Q| = 2n. Let D be the system of all 
subsets D of Q such that |D| = 2q, q = 0,1,...,. Show that 
D is a Dynkin system. 


Give a Dynkin system that is not a o-algebra. 


Show that if D is a Dynkin system then D, EF € Dand E C D 
=> D\E ED. 


Prove the statement: A Dynkin system D is a o-algebra if and 
only if Dis 1 -stable. 


Show that the inverse image of a ring 7# under a map 
f :Q) — OQisaring inQ). 


Show the equivalence of two definitions of a semi-ring if 
property c) in Definition 1.1(v) can be replaced with 


c’) Let A, Aj,...,A,€S. Then there is a finite tuple 
Ci,..., Cy of disjoint sets from S such that 


A\ UAs =o C, 
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2. SYSTEMS' GENERATORS 


This section is concerned with generators of o-algebras which is 
somewhat analogous to the way how subbases and bases generate to- 
pologies. However, in the case of o-algebras, there will be more 
varieties of generators including topologies themselves. We begin 
with the following theorem. 


2.1 Theorem. The intersection of arbitrarily many o-algebras 
(algebras, monotone systems, rings) in Q is a o-algebra (an al- 
gebra, monotone system, ring). O 


(See Problem 2.1 .) 


2.2 Remark. Let G be an arbitrary collection of subsets of (2. 
There is obviously a o-algebra containing G, for instance, the power 
set P(Q). If we collect all c-algebras that contain G and find their 
intersection, it must contain G and, due to Theorem 2.1, it is a o- 
algebra too. This o-algebra is clearly the smallest one containing G. 
It is called the o-algebra generated by G and it is denoted by »/(G). 
The system of sets G is called the generator of 3’(G). It is worth- 
while to recollect the analogue of a subbase or base and their role as 
generators of the smallest topology. Although, as we learned, the 
classes of generators of topologies are somewhat limited, their coun- 
terparts for o-algebras are more functional and form a significantly 
richer reserve filled with such prominent systems as semi-rings, 
rings, Dynkin systems, and monotone systems. Among them, rings 
and semi-rings are often used as generators of o-algebras (throughout 
this book, especially, in Chapter 5) in Carathéodory construction of 
measures. Another frequently used generator is a topology itself that 
we see in action regarding a characterization of regular and Radon 
measures [Dsh2] and in calculus of Lebesgue - Stieltjes measures. 
The smallest c-algebra containing a topology 7 as a generator is 
called a Borel o-algebra and it is denoted B(r) or by B(Q) or just B 
whenever the nature of 7 is specified. Of the various Borel o- 


algebras we come across, some are generated by the usual topology. 
O 
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2.3 Example. Let G be the system of sets containing only one 
subset A of 2. As mentioned in Example 1.2(iii), the smallest o- 
algebra containing G is {Q, @, A, A‘}. O 


Problem 1.10 states that a Dynkin system is a o-algebra if and 
only if it is M-stable. The proposition below generalizes it by 
allowing the Dynkin system to have just a M -stable generator. 


2.4 Proposition. 4 Dynkin system is a o-algebra if and only if it 
has an () -stable generator. 


Proof. Let G be an f-stable system of subsets of ({. Then 
S(G) = D(G). Because every o-algebra is a Dynkin system and 
D(G) is the smallest Dynkin system containing G, D(G) C 57(G). 
The inverse relation remains to be shown: 


Let D € D(G) and let 


Dp = {QE P(Q): QNDE DG}. 


a) We show that Dp is a Dynkin system. 
If A € Dp then AN D € D(G) and 


Aon D=D\A=D\(AND)€ DG) 


(see Problem 1.9). This yields A° € Dp. Similarly, let {A,,} C Dp 
be a sequence of pairwise disjoint sets. Then 


A, 1D € DG), form = 1,2)<20 
Obviously, {A,, ™ D} is a sequence of pairwise disjoint sets and 
ey AnD = (SE, An) ND € DE), 


implying that Sear Ay € Dp. Therefore, Dp is a Dynkin system. 
b) We prove that for every D € D(G), D(G) C Dp. 
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Let G € G. Then G € D(G). In as much as G is  -stable, it follows 
directly from the definition of Dg that GC Dg. Thus G C D(G) 
C Dg, since D(G) is the smallest Dynkin system containing G and 
Dg is just a Dynkin system containing G. Now let D € D(G). Then 
DeéDe¢ and GNDeED(G), implying that G € Dp or GC Dp. 
This yields that D(G) C Dp, because again D(G) is the smallest 
Dynkin system containing G. 


c) We show that D(G) is M -stable. 


Let C,DED(G). Then D(G) C Dp and C,DEDp. Thus, 
Cm D € D(G) (by the definition of Dp), and therefore D(G) is N - 
stable. Finally, by Problem 1.10, D(G) must be a o-algebra. Then, as 
the smallest o-algebra, '(G) C D(G). This is the desired inverse 
relation. O 


In the next lemma and theorem we present a construction of the 
ring generated by a semi-ring. 


2.5 Lemma. Let S be a semi-ring in Q and let R be the system of 
all finite unions of elements from S. Then any element of R can be 


represented as a finite union of pairwise disjoint sets from S; in 
notation, )\;_,Ck, Cy € S. 


Proof. Let R € R. Then by the definition of R, R= US;, 
k=l 


where 5S; € S. We now construct a decomposition of R by elements 
of S using S;,. Let 


Ry = Si, Ro = So\Si, Rs = [$3\$1] 1 [$3\S9], «-- , 


k-1 
He = [) sels = luccsg he 
jel 
Because S;\5; = >> Ciz j is a finite union of elements from S, it 


follows that 


kl k-1 
R, = N) iCing =U Cry 
= 


a a j= 


(as a finite union of finite intersections) 


250 CHAPTER 4. MEASURABLE SPACES AND MEASURABLE FUNCTIONS 


= Dix, 


where D;;, are elements of S. It is easily seen that 


i= |) Se= |) Re, 
k=1 


k=1 

where FR), are pairwise disjoint. This leads to R= >> 5) Di, and 
k=l i 

the lemma is proved. O 


2.6 Theorem. Let S be a semi-ring in Q. Then the system of all 
finite unions of sets in S is the (smallest) ring R(S) generated by S. 


Proof. 
1) We show that R described above is a ring. Because S C R, 
we have 0 € R. Let Ri, Ry € FR. Then, by the definition of R, 


= US}, Fa = US. 


Therefore, 


RiUR, =U U(S}US?) ER. 
k=1 i=1 


By Lemma 2.4 and by Problem 1.2(c) (Chapter 1), we have 


Ri\R2 = ( taCk)\(SDi) 


m n mm 


7 if (25 (Ce\D))) 7 ee 


Because C; and D;, as elements of S, are semi-ring sets, the sets 
Vik : : 

C;,\D; = >> E** and E* are also elements of S. Therefore, 
s=1 


n mM Tik 


R\Ra= Yo f\ LEH = 3 3 (EMER. 
k=1 s=1 i=1 


=1 i=l s= 


2) Now we show that R = R(S). Let S C R’, where FR’ is any 
ring in 2. As a ring, R’ is closed with respect to the formation of 
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finite unions of sets from 7’. Specifically, it is closed under finite 
unions of sets from S; hence, it includes #. Consequently, R is the 
smallest ring generated by S. O 


In Remark 2.2, we defined a Borel o-algebra as a o-algebra 
generated by a topology. We show below that the smallest o-algebra 
5/(S(R")) generated by the semi-ring of all half-open intervals (a, 5] 
in IR” coincides with the Borel o-algebra B(IR”). 


2.7 Theorem. Let 7, be the usual topology on R" and Tf, and 
K. denote, respectively, the systems of all closed and compact sub- 
sets of IR". Then the following relations hold. 


BOR"): = 3(r.) = Dr) = D(K.) = 5(S(R). 


€ 


Proof. Because all compact sets in (R",7.) are closed and 
bounded, it follows that 


Ke c i. C 2 (te); 
and thus 

B(K.) © U(r?) 2.7) 
On the other hand, every closed set F’ can be represented as a count- 
able union of compact sets Ci, € 7, k = 1,2,... . For instance, if 


C(c,k) denotes the compact ball centered at some point c and with 
radius k € N, then we may choose Cy, = FM C(c,k) implying that 


r= ie C;,. Therefore, all closed sets belong to the o-algebra »’(K,) 
(because this c-algebra contains countable unions); that is, T° C 
3)(K.) which yields 

E(re) C E(K). (2.7a) 


Both inclusions (2.7) and (2.7a) lead to '(7°) = 3'(K.). Because 
open sets are complementary to closed sets, it follows that 


B= X(te) = E(7e) = Ui (Ke). 
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Now we show that B = ¥7(S). Any half-open interval (a, b] in R” 
can be represented as the intersection of a sequence of bounded open 
intervals of type (a,b,) (or as we called them earlier, open rect- 
angles) such that b,, | b. Therefore, the collection S of all half-open 
intervals belongs to the c-algebra ’(7.), which implies that 37(S) 

C »(7-). On the other hand, any open bounded interval can be re- 
presented as the union of a sequence of half-open intervals of S; and 
any open set is a countable union of bounded open intervals as base 
sets (recall that (R",7.) is second countable). Therefore, any open 
set is the union of countably many half-open intervals from S and we 
have t. C 37(S), implying 1(7-) C ©’(S). Dual containment yields 
JS) =D; O 


PROBLEMS 


2.1. Prove Theorem 2.1. The intersection of arbitrarily many o- 
algebras (algebras, monotone systems, rings) in Q is a o- 
algebra (an algebra, a monotone system, a ring). 


2.2 Show that an intersection of semi-rings in (2 need not be a 
semi-ring. 


2.3. Show that a union of o-algebras in 1) need not be a o-algebra. 


2.4 Let A and B be nonempty subsets of Q and let G = {A, B}. 
Find D(G) and 3'(G). Show that D(G) and 27(G) are identical 
if and only if one of the following conditions holds. 


AN Bor An B* or A° ON Bor A° N B* is the empty set. 
(a) Use Problem 1.10 to justify it. 


(b) Use Proposition 2.4 to justify it. 


2.5 Let »’ be a o-algebra in (2 and let B C (2. Show that the o- 
algebra generated by G = 5’ U {B} is of the form 


Sa Aca hts by AA eo 
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2.6 


2.7 


2.8 


2.9 


[Hint: 1) Show that o(”’) = o(S’U {B}). 2) Show that &” 
is a o-algebra in 2.] 


Let G, and G2 be systems of sets in (2. Show that G, C G 
implies that ©(G,) C ¥7(G2). 


Let Q be an arbitrary nonempty set and let A,BCQ. 
Construct for the topology 7 = {0,0,A,B, ANB, AU B} 
the Borel o-algebra B(rT). 


Construct the Borel o-algebra generated by the cocountable 
topology Tt = {0,@, A° : A is at most countable} (see Prob- 
lem 1.9, Chapter 3, where (2 is an uncountable set). 


Show that any open set in IR” can be represented as at most a 
countable union of disjoint half-open cubes. 


254 CHAPTER 4. MEASURABLE SPACES AND MEASURABLE FUNCTIONS 
NEW TERMS: 


o-algebra generated by a collection of sets, 247 
generator, 247 

B(r), Borel c-algebra, 247 

o-algebra generated by a set, 248 

semi-ring, property of, 249 

ring generated by a semi-ring, 250 

B(R"), Borel c-algebra in Euclidean space, 251 

Borel o-algebra generated by a cocountable topology, 253 


3. Measurable Functions 255 


3. MEASURABLE FUNCTIONS 


Measurable functions which we introduce in this section are 
analogues of continuous functions f operating between two topol- 
ogical spaces, say (X,7) and (Y,7’). Recall that continuity of f was 
first defined at a point and then on X followed by a theorem that f 
was continuous if and only if f**(7’) Cr. In the present case, a 
function acting between two measurable spaces (Q, ¥’) and (’, &”) 
is regarded as measurable if f**(37") CX’. 


3.1 Definition. Let (Q, 5’) and (9, 2”) be two measurable 
spaces. A function [Q, ’, f] is called measurable if f**(2”) CX, 
that is, if VA’ € &” f*(A’) € ’. The collections of all measurable 
functions from (2,57) to (Q', 2”) are denoted by the symbol 
eA, 220! 2, 


Notice that symbol C~! is a natural extrapolation of the common 
notation in analysis, where C” stands for the space of all n-times 
continuously differentiable functions, with C° (or simply C) standing 
for the space of all continuous functions. So, not only has C~! been 
vacant, but it also agrees with the existing linear order (C",n = 
= 0, Vy naw O 


3.2 Remark. In Example 1.2(viii), we saw that f**(”) is a o- 
algebra in (. We wish to call it the o-algebra generated by function 
f. This is the smallest o-algebra relative to which f is measurable. 
Recall that if 7’ is a topology on 1)’, then f**(7’) is the weakest 
topology on (2 relative to which f is continuous. O 


3.3 Examples. 


(i) Each identity function f: (Q, 27) — (Q, &”) is measurable if 
and only if 3” C ¥. 


(iz) Let f: (Q, 27) — (Q', 2”) be a constant function, that is, 
fw) =cE, Vw EQ. Therefore, f*({c}) =Q and f*({c}°) = 
@, which yields that for each A’ € X’, f*(A’) is either Q or O. The 
latter implies that f is measurable with respect to the smallest o- 
algebra {Q, @} in Q. Thus, f is always measurable. 
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(iti) Let f =1,4 for some A CQ. Let &” be an arbitrary o- 
algebra in R (for instance, the Borel o-algebra). It is easily seen that 
the inverse image under f of any subset of R (specifically, of any 
subset of 7”) is one of the elements of the set 3’ = {0, 0, A, A‘}. 
Therefore, 5’ is the smallest c-algebra with respect to which f is 
measurable. On the other hand, if »’ is a o-algebra in 2, then 1, is 
measurable if and only if A € »’. O 


There is a noteworthy parallel between continuity and measur- 
ability of functions and their relationships with topologies and o- 
algebras mentioned in the beginning of this section. Recall that a 
function [Q, 2’, f] is continuous on (2 if there are two topologies 7 
and r’ declared on 2 and 1)’, respectively, and f**(7’) Cr. If, in 
addition, 7’ is known to be induced by a subbase G’, then the 
condition f**(7’) C 7 can be relaxed by f**(G’) C 7, as per Theorem 
4.8, Chapter 3. The pointed-out analogue of continuity with measur- 
ability is utilized by the following proposition. 


3.4 Proposition. Let (Q, ©’) and (Q’, =") be two measurable 
spaces and let G' be a generator of 5". Then a function f: Q — Q is 
measurable if and only if f**(G') CX. 


Proof. Let 
B= {Qo : fr(Q’) € Th. (3.4) 
It can be shown that ©’ is a o-algebra (Problem 3.1), which 


resembles the quotient topology. Now let f**(G’) C »’. Then from 
(3.4) it follows that G’ C 3) and hence by Problem 2.6, 


Sy arGyes. 
Therefore, f is measurable. Obviously, >’ is largest o-algebra with 
respect to which function f is measurable. The converse is trivial. O 
3.5 Examples. 


(i) Let f: (Q,7) — (’,7') be a continuous function on a 
topological space (0,7). Then f**(r’) Cr C Bir) (the Borel o- 
algebra generated by 7). By Proposition 3.4, the function f is then 
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B(r)-B(7r')-measurable. Here f is an example of a Borel measurable 
function. 

(iz) In a more general setting, f : (02, B(7)) > (’, B(7’)) is 
called a Borel measurable function or just Borel function, if 


f"(Blr')) © Bir). 


Now, by Proposition 3.4, f is Borel if and only if f**(7’) C B(7), 
that is, if the inverse image of an open set is a Borel set. 


(iti) Let f : (R", B(R")) — (R”, B(R")) be a function. Because 
by Theorem 2.7, the Borel o-algebra B(IR”) has a variety of genera- 
tors, including 7,75,K.,S, to see whether f is Borel, we can 
examine if f**(-) lands at B(IR”) with one of these families. The 
most common family used for verification is the semi-ring S of half- 
open rectangles. 


For example, let f = 1g (the Dirichlet function). Obviously, Q is 
a Borel set, because Q is the countable union of the singletons 
representing all rational points. Each one of them is a closed set, 
because 7. is Hausdorff. If (a,b] € S (a < b), f*((a,b]) is either 
Q, or Q, or Q°, or R. Hence the Dirichlet function is Borel. i 


Measurability, like continuity, is preserved under the compo- 
sition. 


3.6 Proposition. Let f,: (01, ©) > (Qe = fi,(Q1), ©) and fr: 
(Q2, 32) — (OQ3, 13) be measurable functions. Then the composi- 
tion fr © fy : (Q4, 2) 3 (OQ3, 73) is measurable. O 


(See Problem 3.2.) 


3.7 Remark. Let {0;, 0;: i € I} be an arbitrary collection of 
measurable spaces and let {f;: Q > Q;:7€ I} be a collection of 
functions defined on a set (2. Every function f; of this family is 
clearly f;*(2/;)-2);-measurable. We are interested in constructing the 
minimal o-algebra in (2 relative to which all functions of the family 


are measurable. Since ) f*(2;) is not, in general, a o-algebra, it is 
tel 
reasonable to regard it as the generator of the o-algebra generated by 
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the family {f;; i € I}, in notation, »7(f;;i € I). Recall the analogue 
with continuous functions of Remark 5.10, Chapter 3. O 


3.8 Lemma. Let {g;: (0,57) — (Q;, ;)} be a collection of 
functions on Q and let f : (Qo, Xo) 3 (Q, 2’) be a function on Qo. 
The function f is Sig-57(g;:i € I)-measurable if and only if each of 
the functions g; © f iS 3/o-57;-measurable. 


Proof. 
1) Let g, o f be Xo-%,-measurable Vk € I. Then VA;z € Xi, 
(9.0 f)*(Ak) = f* ogi (An) € Xo, 
where g.(Ax) € U g;(©;). Taking A; from %, for each respective 
k © I we run ‘iene set LU g?*(;) whose elements are further 


ieLl 
transferred by f* into Xp. In other words, we have 


FF" U 95 (2%) & %o. (3.8) 


ie 


Because [J g;*(27;) is a generator of X'(g;; 7 € I), according to 
iel 

Proposition 3.4, inclusion (3.8) is sufficient for f**(2°(g;;7 € [)) 

C Xo. Therefore, f is indeed ’p->’(g;;i € I)-measurable. 


2) Let f be Xo-%'(g;;i € I)-measurable. This implies that 
VE EG=U gi (Xi), f(E) € Xo. 


ie 


Besides, VA; € X, , 


9: ( An) € U gi (2). 


ie. 


Thus, VA, € ©, (gx 0 f)*(Az) € Xo, which means that g; 0 f is 
djo->/,-measurable. i 


3. Measurable Functions 259 


PROBLEMS 


3.1 


3.2 


3.3 


3.4 


3.5 


Show that 5? in the proof of Proposition 3.4 is a a-algebra. 


Prove Proposition 3.6: Let fi: (Q1, 1) > (Qe = fir.(O1), 22) 
and fx: (Qe, 4) — (Q3, 3) be measurable functions. Then 
the composition fz ° fy : (Q1, X41) — (Q3, 3’3) is measurable. 


Let f :Q — 2 bea function and let G’ C P(’). Show that 


FBG) = SFG). 


[Hint: Let 3: = {A’ € P(O’): f(A!) € U(f*(G’))}. Show 
that ¥’ is a c-algebra. Then show that 5(G’) C 57] 


Let [O,O,, F] be a homeomorphism, with O and Oj being 
open sets in topological spaces (X,7) and (X,7,), respective- 
ly, and let B(79) and B(7¢,) be the Borel o-algebras generated 
by the relative topologies 7T> and 70,. Prove that [B(70), 
B(To,), F] 1s bijective. 


Let [O,O,, F'] be a homeomorphism, with O and O, being 
open sets in topological spaces (X,7) and (X, 71), respective- 
ly, and let B(7o) and B(7o,) be the corresponding Borel o- 
algebras generated by the relative topologies 79 and 70,. Sup- 
pose B C O. Show that if F,(B) is Borel, then B is also Borel. 
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Chapter 5 


Measures 


This chapter is a precursor to the general theory of integration, which 
is a significant advancement from the Riemann integration known 
from calculus. Although many applications in the natural sciences 
triggered the development of general integration and measure theory, 
the theory of probability has become the primary client of abstract 
measure even prior to integration. 


An early notion of measure was introduced by the Italian 
Giuseppe Peano (1858 - 1932) in 1883. For a simple set in the plane, 
Peano used two types of polygons that contain and are included in 
the given set. The areas of the polygons of the former type have a 
greatest lower bound and of the latter type, the least upper bound. If 
these limits coincide, their common value is said to be the area of 
the set. However, if the limits differ, the concept of area would not 
apply. Apparently, Cantor's development of set theory greatly 
influenced Peano's concept of area for arbitrary sets in his 1887 
monograph, Applicazioni geometriche del calcolo infinitesimale. He 
generalized his original idea on inner and outer measures of sets by 
polygons for two- and three-dimensional Euclidean sets. Peano 
emphasized the close connection between measure and integral. 


In 1892, the Frenchman Camille Jordan (1838 - 1922) arrived at a 
more advanced concept of measure as a countably additive set func- 
tion applied first to positive and then to signed set functions. The 
latter led to the prominent Jordan decomposition of a signed measure 
in two positive, which we will study in Chapter 6, section 6. Jordan's 
motivation of the concept of measurable sets apparently stemmed 
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from the theory of double integration, which naturally arises when 
introducing integrals on arbitrary plane sets. However, Jordan's 
approach to the measurements of sets was restricted to the common, 
at that time, finite covers of sets by intervals or rectangles. 


The most revolutionary steps were undertaken by the Frenchman 
Emile Borel (1871 - 1956) in his famous 1898 monograph, Lecons 
sur la theorie des fonctions, where he introduced the idea of 
countable, instead of finite, covers, thereby significantly extending 
classes of measurable sets. Borel also pointed out in 1905 to a 
possibility of using measure theory in probability, which was suc- 
cessfully accomplished by the Russian Andrey Kolmogorov (1903 - 
1987) no earlier than 1933. However, in his Leéons, Borel did not 
bother to connect measure and integration. 


In 1902, another Frenchman, Henri Lebesgue (1875 - 1941), fur- 
ther refined measure theory by combining the ideas of Camille 
Jordan on finite contents with the countably additive measure notion 
of Emile Borel. Lebesgue called sets in R” measurable whenever 
their inner and outer measures are equal. This led to the completion 
of the concept of measure and gave rise to the general theory of 
integration so significantly enlarging the class of integrable functions 
that it made Lebesgue say in 1902: “I know no function that is not 
summable and I do not know if any such exists.” (However, the 
Italian Giuseppe Vitali showed the existence of such a function in 
1905.) Lebesgue also established several central theorems in the 
theory of integration; one of them is the famous Lebesgue dominated 
convergence theorem. 


Finally, the Austrian Johann Radon (1887-1956), in his 1913 
habilitation work began to study abstract measures and integrals 
more general than those of Lebesgue. Radon is also the author of the 
well-known Radon and Radon - Stieltjes integrals. The latter is most 
frequently referred to as the Lebesgue - Stieltjes integral. Radon's 
ideas led not only to the abstract theory of measure and integration 
but also to its applications in the boundary value problems in the 
theory of logarithmic potentials (developed by Radon himself) and 
contemporary theory of probability and stochastic processes. 
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1.1 Definitions. 


(i) Let ©’ be a system of subsets of 2 including the empty set 
@. A numerical function [3’, R, ju] such that j4(@) = 0 is called a set 
function. In this chapter we only consider nonnegative set functions 
pid > RR. 


In the definitions below we assume that corresponding sets are 
elements of >’. 


(ii) A set function su is called finitely additive or just additive if 
for any n-tuple A,,...,A, of mutually disjoint sets with }>7_, A, 
ex. 


1 ( trAs) = pa HAs). 


(iit) A set function py is called o-additive if for any sequence 
Aj, Ao,... of mutually disjoint sets with S>** , A, € 3’, it holds that 


n=1 


a oS An) = re M(An)- 


(7v) A set function ~ on > is called continuous from below on 
+) if for every monotone nondecreasing sequence {A,,}{ such that 


L) A, € 2 it holds that 


7 jim. H(An) = u(U U An). 


If this condition is known to hold just on a particular monotone non- 
decreasing sequence {A,,} 7, then ju is said to be continuous from 
below on {A,}. 

(v) Let {A,} be a monotone nonincreasing sequence and 
() An € &’. A set function on »’ is said to be continuous from 
n=1 
above on {A,,} if 

Jim, w(An) = n{ A An): (1.1 


n=1 
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The set function jz is called continuous from above on 5 if equation 
(1.1) holds for every monotone nonincreasing sequence {A,,} | 
such that (] A, € 2’. In particular, if {A,,} | @, (1.1) reduces to 


n=1 
dim. [(An) =0 
and this is referred to as continuity from above at the empty set or, 
for short, O-continuity of 1. 


(vi) A set function su is called finite on »’ if (A) < oo for all 
Ae x. 


(viz) A set function p is called o-finite on »’ if 3’ contains a 
CO 
sequence {A,,} monotonically increasing to 2 (that is, LJ A, = Q) 


n=1 
and j1 is finite on {A,,}. In this case, we also say that ju is o-finite on 


{An}. 
(viit) An additive set function ju defined on a semi-ring S (in () 
is called an elementary content (on S). 


(iz) An additive set function y defined on a ring R (in 2) is 
called a content (on R). 


(x) A o-additive set function j: defined on a ring R or algebra 
A (in Q) is called a premeasure (on R or A). 


(xi) A o-additive set function jz defined on a o-algebra ¥’ (in 2) 
is called a measure (on %). If, in addition, w(Q) = 1, then pu is 
called a probability measure. 


(xii) Let (Q, X’) be a measurable space and let ,, be a measure on 
+7. Then the triple (Q, 2’, 1) is called a measure space. If fu is a 
probability measure, then the triple (Q, 2’, 1) is called a probability 
space. O 


1.2 Examples. 


(i) Let »’ be a o-algebra in Q and let a € 2 be a fixed point. 
Define the following set function ¢, on 4’. For each A € 2’, we set 
Eq(A) =1lifa eA and «,(A) =0 if a ¢ A. Then ¢, is a measure 
on ». It is clear that ¢,(@) = 0 and that e, > 0. Let {A,} be a 
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sequence of pairwise disjoint sets from »’. Then a can either belong 
to exactly one set, A; (for some 7), or to no set of the sequence. In 
the first case, 


Eq Oo aw = il 
and in the second case, 


ég Ou 14a) =O 
On the other hand, in the first case, 


yea An) = 1, 


and in the second case, 


> -1€a(An) = 0. 


Therefore, €, is o-additive. The measure €, is called a point mass or 
Dirac measure. 


(i7) Let 2 =R”. In Example 1.2(v), Chapter 4, we introduced 
the system S of all half-open intervals (a, b]} C IR", which was shown 
to be a semi-ring on R”. For a = (aj,...,@,) and b = (b1,..., bn) 
(a; < bj), we define 


d((a, b}) =T] (be — an) and °(@) = 0. 


k=1 


Then \° is obviously an elementary content on S. ° is said to be the 
Lebesgue elementary content. 


(ii) Let [R, R, f] be a bounded, monotone nondecreasing, right- 
continuous function that vanishes at —oo. Any such function is 
called a distribution function. There is also a variant of the so-called 
extended distribution function [IR,R, f], which is monotone nonde- 
creasing, right-continuous, but need not be bounded over unbounded 
sets and does not vanish at — oo. (As a right-continuous function, an 
extended distribution function is bounded over bounded sets though.) 
Because f is monotone nondecreasing and right continuous, for any 
two real numbers a < b, the set f*((a,b]) is Borel (show it; Problem 
1.15). In other words, we have f**(S) C B(R), where S is the semi- 
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ring of left half-open intervals. Thus, by Proposition 3.4, Chapter 4 
[see also Example 3.5 (iz), Chapter 4], f is Borel. 


So, every distribution or extended distribution function is Borel. 
Both types of functions are subclasses of monotone functions (which 
are introduced and studied in forthcoming chapters). 


Let S be the semi-ring of half-open intervals in R. Given a 
distribution function f, we define the set function ty on S as 


p(O) =0 and ph ((a,b]) = f(b) — f(a). 


Ly is clearly additive on S and therefore is an elementary content on 
S. 


Ly is called the Lebesgue - Stieltjes elementary content. 


(iv) Let Q be an uncountable set and let 
= {Ae P(Q): either A or A° < N}, 


which is a o-algebra on 2 [see Example 1.2(vii), Chapter 4]. Then, 
define VA € %, 


0, AxN 
wad dean 
We show that jz is a measure on (2. First observe that p> 0 and 
p(O) =0. Let {A,:n=1,2,...} be a sequence of pairwise 
disjoint sets of SX’. If the union U°°, A, is at most countable, then 
each A,, is at most countable, and thus 


bh bey Ay] =0= PO L(An) = pa 


i A,]° is at most countable, then we argue that there is 
exactly one set of the sequence {A,,} with at most a countable 
complement. Suppose that there is yet another set from this sequence 
with this property. Then both of them, say A and A‘, are at most 
countable and hence A‘, Aj is also at most countable. In as much 
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as Aj;M Ay, = @, ASU A, =, which is a contradiction, for 2 is 
uncountable. Therefore, we have exactly one set A; such that A® is at 
most countable. Then, 


Soa Aud! = 74s 


is at most countable and 


Le oe A,| =1 
On the other hand, 


Yone1 H(An) = (Ar) + +++ + w(Ai-1) 


0 
+ w(Ai) + (Ain) +e = 1. 


1 


This yields c-additivity of ju. 


(v) Let {0,27,y;:i=1,2,...} be a sequence of measure 
spaces and let {a;:i=1,2,...} be a nonnegative numerical 
sequence. Define jz on >’ as 


i = > 1 Onlin (1.2) 


Then js is a measure on 3’. (See Problem 1.4.) 


(vi) Consider the special case in the last example with ju, = €,,, 
n=1,2,..., where {b,} CQ. In other words, ju, is a point mass 
that was introduced in (i). Then the measure ju defined in (1.2) is 
called an atomic (or discrete) measure. A further special case is of 
interest. With the given j1,, we also assume that the sequence {a,,} 
has the property 


yin=1On = 1. 


It is readily seen that the measure ju: is a probability measure; more 
specifically, it is an atomic (or discrete) probability measure. 
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(vit) Let Q be an uncountable set and let = {A € P(Q): 
either A or A° < N} as in (tv). Define VA € 1’, (A) = Oif A is at 
most countable and ju(A) = oo if A° is at most countable. Then pu is 
a measure on 2’. 


(vizz) Let 2 be an arbitrary set and »’ be a o-algebra on (2. Define 
the following set function yw on S’. For each A € XY’, p(A) = |A| 
(that is, the number of elements in A), if A is finite and j(A) = on, 
otherwise. Then, it is easy to verify that j: is a measure on 2’. It is 
called a counting measure and (Q, 2’, ju) is referred to as a counting 
measure space. Note that if (2 is at most a countable set, a counting 
measure jz can also be expressed in terms of the measure introduced 
in (1.2), with a,, = 1 and pi, = &,. O 


1.3 Proposition. Let ,1 be an elementary content on a semi-ring 
S. Then the following hold true. 


(i) For any two sets A and B from S with AC B, wWA)< 
[(B) (monotonicity). 


(ti) Let Ay, Ao,... be a sequence of pairwise disjoint sets from 
Sand A, C A € S for each n. Then, 


yr MAn) < w(A). (1.3) 
(iti) Let A, Ay, Ao,... € S and AC\ An. Then there is a 


n=1 


countable decomposition )>°_,C;, of A with C;,'s © S such that 
dopet(Ce) S DOP eAn). (1.3a) 


Proof. 


(@) B=A+B\A=A+)0_,C,, where C,'s € S. Hence, by 
additivity of ju, 


u(B) = (A) + D751 (Cs) 


and the statement follows. 
(72) By Problem 1.12, Chapter 4, 
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A\><"_, Ar = 32™.,C., where C, € S. 
Hence, because of the assumption that A,, C A, 
A=yv Ay oC, 
Thus, by additivity of on S, 
pA) = hy W(An) + OX WC), 
which implies that 
L(A) > dof, (Ax), good for all n. 


Consequently, 
pA) > Yo H(Ar). 


With B, = A, M A (€ S) we have that J B, = A. Denote 


n=1 


dD, = Bi, Dyes = Baa) U Ba n= L,2; see 
j=l 


Then, A = )°*°., Dy and, by Problem 1.12, Chapter 4, 
Dy = Cue with Cy's € S. 
Because D; C By, by (12), 
MWC.) < (By) for each k. 
Now, due to monotonicity of j. and because B; C A;,, we have that 
CRD eCse) S$ DRMBw) SCE HMAR), (1.36) 


which yields (1.3a). O 


270 CHAPTER 5. MEASURES 


1.4 Corollaries. 


(7) If yo 4 is a decomposition of A by elements E,,)'s 
from 8S, then [from Proposition 1.3(222)]: 


yo tCe) = pede 5 | (1.4) 
ie., 02, C;, is a “u-minimal decomposition” of A. 


(it) Let A, Ay, Ao,...€ Sand AC VU An. If pu is o-additive 
n=1 
on S, that is, for any sequence of mutually disjoint sets {A,} CS 
with O°, An € S, 


Li Ae) = yo tMAn), 
then (from (1.4)): 
HAYS Ye la) <= A. (1.4a) 


In particular, if A= (J An € S, 


n=1 


u( U Ay) S Via H(An)s (1.4b) 


which is known as the o-subadditivity property. 


Inequality (1.4b) (originally applied to a semi-ring S and elemen- 
tary content /1) implies: 


(ii7) If wis a content on aring R then for any Aj,...,An € R, 


mee U Ay) < Sof, MAz) (finite subadditivity). 
k=1 


(iv) If wis a measure on ao-algebra ») then the o-subadditivity 
property (1.4b) is valid for any sequence {A,,} C &. 


1. Set Functions 271 


(v) Because of the monotonicity property of an elementary con- 
tent, due to Proposition 1.3(i), Definition (1.1(iv)) of finiteness can 
be relaxed for set functions from elementary contents and above” 
by requiring merely ,(Q) < oo. O 


There are two more minor properties of contents left as an 
exercise. 


1.5 Proposition. Every content 4s on a ring R has the following 
properties. 


ee AC B= p(B\A) = w(B) — p(A) [provided that (A) 
< OO}. 


(it) p(AUB) = (A) + p(B) — w(AN B). O 

(See Problem 1.2.) 

1.6 Lemma. Let ju be a content on a ring R. Then ju is a premea- 
sure (i.e., o-additive) on R if and only if js is continuous from 
below. 

Proof. 

1) Let 2 be a premeasure on FR and {A,,} a monotone non- 


decreasing sequence in R such that A: = LJ) A, € R. If f(A) = co 
n=1 

for some n, by monotonicity, ju(A;) = oo for all k > n and y(A) = 

oo. Then continuity from below follows immediately. Assume that 

t(An) < oo for all n. Denote 


By = An\An-1 ; n= 1, 2 a) Ag =@. 


Then 
nr 
n— k=1 B k 


and { B,,} forms a pairwise disjoint sequence from R with 


1 Ba HA, 
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Because pi(A,) < co (by the above assumption), j(B,,) is well 
defined and, therefore, due to c-additivity of ~ (as a premeasure), 


jim, H(An) = lim, her H(Ba) = pe HBr) 
= # Oop Be) = vA), 


which yields continuity from below. 


2) Now let jz be a content on R, which is continuous from below, 
that is, for every monotone increasing sequence of sets in R, {A,,} 7 


A: = \J A, such that A € R, it holds that 
n=1 


Let {C;,} be a sequence of pairwise disjoint sets. By setting 


n 
By, — k=1 Ck, 


we get {B,}7 CR, with U B, = B and hence Jim. yu(Bn) 


= p(B), which is equivalent to 
dims Hora Ce) = fim pan H(Ce) = MOQ Cn). 


This is the desired o-additivity of pu. O 
1.7 Theorem. 
(i) If is a premeasure on a ring R and if {A,}| CR such 
that (A) < co and A: = Q A, € R, then js is O-continuous (or 


n=1 
continuous from above) on sequence {A,,}. In particular, if 1 is a 
finite premeasure on ring R then js is O-continuous (or continuous 


from above) on R. 


(77) If is a finite content on a ring R then ©-continuity implies 
that | is a premeasure. 


1. Set Functions 273 


Proof. 
(i). Because {A,,} is monotone nonincreasing, {A;\A,}? C 


R. It can easily be shown that (J (A;\A,) = AN\( ‘a An). Now 
n=1 n=1 


we apply Lemma 1.6 to the sequence { A;\A,,} to arrive at 
Jim,. w(4s\4n) = n( U(4s\An)) = H(r\a). (7 
n=1 


In as much as yu is finite on {A,,}, by Proposition 1.5(2), equation 
(1.7) reduces to 


(Ay) — fim. W(An) = w(A1) — H(A) 


and thereby yields assertion (7). 


(ii) We show that @-continuity implies Lemma 1.6 and that 
Lemma 1.6 in turn yields (iz). Let {A,} 7 CR such that {A,,}7 
AER. Then {A\A,,}| @ and @-continuity yield 


jim. (A\An) = 4(@) = 0. 


Because the content jz is assumed to be finite on R, by Proposition 
1.5 (2), the last expression leads to 


lim |, H(An) = bCA). 


nm —+ CO 


Applying Lemma 1.6 (2) we have that ju is a premeasure on 7. O 
1.8 Examples. 


(i) We show that on some measure spaces, while {A,,} | @, 
wWA,) % 0. Let (Q,4,u) be such that OQ=N, Y= PQ), 
w({n}) = +. Let A, = {n,n + 1,...}. Clearly, {A,} is a monotone 
decreasing vanishing sequence. However, ju(A;,) = co for each n 
and thus, p(A,) — oo. So, G-continuity does not apply, because we 
violated the condition ~(A;) < oo of Theorem 1.7, which, as we see, 
is essential. 
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(it) Consider in 2 the o-algebra ©’ = {0, @, A, A°} and define 


per de —= {1,0;p,1 =p}, 


where p € (0,1). Then y is a (probability) measure on 5’, called a 
Bernoulli measure. Notice that for the traditional Bernoulli measure 
Q = {0,1} and A = {1}. 


(iit) Consider the following atomic measure on the Borel o- 
algebra B(R): 


n 


j= dl anes , 
k=0 


where 
ay = (x) pr(l — py*, pE (0, 1); k= iets n. 


Clearly, jz is a probability measure. It is called the binomial measure 
(with parameters (n, p)) and it is denoted by /3,,,y. 


(iv) Consider another atomic measure on B(R): 


jp= >> 9 Gren, Wheted, =e ? S, a > 0, n=0, Lax. 
jt is also a probability measure, called the Poisson measure (with 
parameter q), in notation, 7,. i 


1.9 Notation. Denote by 9(Q, »’) the set of all measures on a 
measurable space (2, 5’). According to Problem 1.4 [Example 1.2 
(iv)], any positive linear combination a4 41 + a2/12 of measures (that 
is, such that a1, @2 € R,) is a measure. With a = 0, we have ay = 
?, being zero measure if we agree 0 - co = 0, which is commonly 
agreed upon in measure theory. So, with 0 € IN(Q, B’), Mt(O, 2’) is 
a semilinear space over the semifield R,, in notation (Mt(Q, 2’), 
R,, +,.-). We show that this space of measures is also a lattice, 
under a particular lattice operation. O 


1.10 Lemma. Let pi), W2 € W(Q, B’). Then the set function 
p(A) = sup{ii(E) + p2(A\E): Be YN A} 


is a measure on S/. 
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Proof. Obviously, p is a nonnegative set function defined on >’ 
that equals zero at the empty set. 


Next we show that p is monotone. Let E C A C B. Then, we 
have 


[i (EB) + p2(A\E) < pn (E) + po(B\E). 


Because the inequality holds true for all F C A and E C B, p(A) 
< p(B). 


We need to prove that p is o-additive. Let {A,} CX be a 
sequence of disjoint sets with A: = °° ,A,. We will first show 
that p is o-subadditive on {A,,}, that is, 


p(A) < 2% 1p(An). (1.10) 
Let £ be a measurable subset of A. Then, 
[1 (E) + p2(A\F) 
= (dina (An 9 E)) + odin (An 9 E*)) 
= Vine (An NB) + pa(An\(An 1 £)) 
S ene P(An) 
which yields (1.10). 


It remains to show the inverse inequality. To avoid a trivial case, 
we assume that p(A) < oo. Then, by monotonicity, p(A,) < p(A) 
< oo. From the definition of p, given A,, and €, > 0, there is a 
subset E,, C A,, such that 


P(An) < pa(En) + H2(An\En) + én, n = 1,2,.... 


We choose e,,'s such that a < ce, which we choose before- 
hand. Denote EF = pad os Obviously, F,,'s are disjoint and 
EC Aand 


A\E _ ret Ane) = done (An\En); 
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because the rest of E;'s are disjoint with A,,. Therefore, with 


[1 (E) Tr [2(A\E) — En) a pia An\ Ea) 


we have: 


YenwiP(An) S yeaa (En) + Da H2(An\ En) + VperEn 
< ju(E) + po(A\B) +e < p(A) +e. 


This inequality holds for each ¢ > 0. Therefore, we arrive at the 
targeted inverse inequality and we are done with the lemma. O 
1.11 Lemma. Let 11, 2 € IN(Q, X’). Then the set function 
v(A) = inf{yus(E) + jn(A\E) : B€ EN A} 
is a measure on 3). 


The proof of this lemma is similar to that of Lemma 1.12 and is 


left for the reader (Problem 1.16). O 
1.12 Notation. We introduce on 9(Q, 2’) the partial order =< 
such that yw < v if and only if (A) < (A) forall A € &. O 


The following theorem states that (IN(Q, X’), < ) is a lattice. 


1.13 Theorem. Let p and v be defined as in Lemmas 1.10 and 
1.11. Then, p = py V pl. and v = [41 A 2, and consequently, the 
partially ordered set (IN(Q, ’), X ) is a lattice. 


Proof. (i) Let U({p1, ue}) C MQ, X’) denote the set of all 
upper bounds of the two-element set {j11, 42}. We show that 
p € U({p1, v2}). In other words, we show that for each A € 4, 
p(A) > pi(A) and p(A) > po(A), in notation, p > jy and p > po. 
Indeed, for any measurable subset E' C A, 


p(A) > wi(E) + pe(A\F). 


Choosing EF = A or E=@, we arrive at p = {ji1, 2} and thus 
p € U({p1, H2}). Similarly, it can be shown that the measure v from 
Lemma 1.11 belongs to the set £({j11, u2}) of all lower bounds of 
{{41, 42}, in notation, v X {j11, 2}. 
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(i2) Let wu € U({p1, w2}) be another upper bound of the set 
{[1, 12}. We show that p < u, which validates that p is the smallest 
upper bound of {j11, j42}. Let FH C A be two measurable sets. Then, 
we have 


p(B) + p2(A\E) < u(E) + u(A\E) = u(A) 
implying that p X wu and that p = jay V lo. 
Now, let \ € L({p41, 42}). Similarly, 
\(A) = ACE) + MA\E) S pa (E) + p2(A\E) (1.13) 


holding for all EF C A. Therefore, the inequality will sustain for the 
infimum on the right of (1.13). Consequently, \ x v and thus v is 


the largest lower bound of {/11, ju2}. O 

1.14 Remark. In light of Definition 7.9(ii), Chapter 1, 
(N(Q, X'), Ri, +,-, X ) isasemivector lattice. O 
PROBLEMS 


1.1. Let S be the semi-ring of half open intervals on the real line 
and \” be the Lebesgue elementary content. Take A = (0, 1], 
B= (1,2], and C = (3, 4]. Although \°(A + B) = A°(A) + 
\°(B), we cannot state that \9(A + C) = A9(A) + AC), 
because A+ C is not an interval, and therefore, the left-hand 
side of the last equation is not defined. Hence, ° is not 
additive on S. True of false? 


1.2 Prove Proposition 1.5(i, ii). Every content i on a ring R has 
the following properties: 


(i) ACB=>p(B\A) = u(B)— (A) — [provided _ that 
p(A) < oo]. 


(it) (AUB) = pA) + w(B) — w(AN B). 
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1.4 


1.5 


1.6 


Py 


1.8 


1.9 


CHAPTER 5. MEASURES 


Show that for a content on a ring, the notions of continuity 
from below and @-continuity are equivalent. 


Prove that jz is a measure on »7 in Example 1.2(v). 


Let (2,27,44) be a measure space and let {A,: 

n=1,2,...}CX with SO, u(A,) < oo. Show _ that 

me lim An) = 0 [Hint: Apply continuity from above of mea- 
love) 

sure 4 to the monotone nonincreasing sequence { (J Age 


k=n 


A subset R,; of a ring R in 2 is called an ideal in R if it has 
the following properties: 


a) OE Ry. 
b)BER, ACB, AER => AER. 
c) A, BER, > AUBER,. 


Let js be a content on a ring R. Define R, ={RER: 
u(R) = 0}. Show that R,, is an ideal in R. 


A subset R,.; of a ring FR is called a o-ideal in R if, in 
addition to properties a-c) of Problem 1.6, it is closed with 
respect to the formation of at most countable unions. Let ju be 
a premeasure on R. With the same definition of R,, as in 
Problem 1.6, prove that R,, is a g-ideal in R. 


Let R,.; be a c-ideal in a ring ®. Show that there exists a 
premeasure 44 on FR such that R,.7=R,, defined in 
Problems 1.7 and 1.8. 


Let yu be a finite content on a ring R. Show that d(A, B): = 
(A A B), forall A, B € R, is a pseudo-metric on F [that is, 
that d possesses all properties of a metric except for 
d(A, B) = Oyields that A = B). 
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1.10 


1.12 


1.13 


1.14 


1.15 


1.16 


1.17 


Let (, X’,44) be a measure space and {A,}C 2X be a 
sequence of sets with su (0 A) <oo and inf{p(A,): 


n=1 


(R=12 on} HaS o, Show that ;( lim An) ths 


n + CO 


Let (Q,7,~) be a measure space and for a number 
0<a<o, define G, = {G € ©: u(G) < a} and 


EL ={(OCMONGED, VEG). 


Show that »”, is a o-algebra. 


Under the condition of Problem 1.11, let a = oo and yu be a 
finite measure. Show that X’},, = »’. 


Let (0, 27,1) be a measure space, G. = {G€ X:u(G) 
< oo}, and Yo ={QCN:QNG € Y,VG €G}. Define 
the set function j4., on 37, as 


_ J oQ), QE 


(Notice that 1’ C 37.) Show that jz. is a measure on 27. 


Argue that for any probability space (Q, 5’, jz), the axiom ju(@) 
= 0 is redundant. Is it also true for any measure? 


Show that for each two real numbers a <b, f*((a,6]) in 
Example 1.2 (iii) is a Borel set. 


Prove Lemma 1.11. Let fu, fg € MQ, X’). Then the set 
function 


v(A) = inf{~i(£) + wo(A\E): Be YA} 
is a measure on J). 


Prove the following statement. Let {ju,} C IN(Q, X’) be a 
monotone nondecreasing sequence of measures in the sense of 
<x. Then the pointwise limit of the sequence tw: = 
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limpsofn (that is, p(A) = limpofn(A),A€ X), is a 
measure on 5} and ws = sup{ fu, }. 


Let (Q, 2’, P) be a probability space and let G C »’ be a family 
of events. G is called independent if for any finite tuple of 


events G1,...,Gm; (fn Gs) = |] P(G;). Let D(G) be the 
i=1 i=1 


Dynkin system induced by G. Show that D(G) is also indepen- 
dent. 


Under the condition of Problem 1.18, let G be a semi-ring. 
Then show that 47(G), the o-algebra induced by G, is indepen- 
dent. 


Let (Q, 2’, P) be a probability space and let {G,...,G,} bea 
set of n families of events. This set is said to be independent if 
for any choice of events G, € Gi,...,Gn € Gn, the tuple {Gi, 
...,Gp} is independent. If the set {G,...,G,,} is independent, 
show that the set {D(G,),..., D(G,)} (of Dynkin systems in- 
duced by G;'s) is also independent. 


Let (Q, 2’, P) be a probability space and let {G; : i € I} be an 
indexed system of families of events. This system is said to be 
independent if for whatever choice of n indices, 21,...,%, € I 
and then a choice of n events G; € G;,,...,Gn € G;,, the tuple 
{G,,...,Gn} is independent. If the set {G;:i¢ I} is 
independent, show that the set {D(G;):i€ I} (of Dynkin 
systems induced by G;''s) is also independent. 


Under the condition of Problem 1.20, suppose that G;'s are 
semi-rings. Show that the set {2’(G1),..., 7(Gn)} is indepen- 
dent. 
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2. EXTENSION OF SET FUNCTIONS TO A MEASURE 


We begin this section with the introduction of a set function that is 
not exactly a measure, as it is not even additive, but which is at the 
heart of the formation of measures extended from some more primi- 
tive set functions. A prominent example of such a construction yields 
the Lebesgue measure. It is firstly defined on “rectangles” and rect- 
angular figures. Then the measurement of a more general set is ac- 
complished by an approximation of the set by rectangles that cover 
the set. The latter leads to the notion of an “outer measure,” which 
was initially proposed by Lebesgue at the turn of the last century and 
later on refined by Carathéodory. Carathéodory's approach is 
essentially preserved in the contemporary constructions. 


In the four steps below we outline the Carathéodory extension 
procedure. 


1. An Outer Measure. We first introduce an outer measure |4* on 
P(Q) that is a monotone set function(A C B => p*(A) < p*(B)) 
and o-subadditive: 


{Qn} £ PM) > w'( Un) < Dew 

Secondly, we identify the family of jpu*-measurable sets 
+/* C P(Q) that exhibit some sort of additivity property. Namely, 
for an M € &”* (because M separates nyQ COQ: Q=QNM+ 
QM M°) we have p"(Q) = w(QN M) + w(QN M*). 


It turns out that X’* is a o-algebra in (2 and ju) = Resy-* is a 
measure on »%'*. Moreover, this measure is complete. The latter is 
often a valuable property on its own (utilized in many applications); 
namely, if (Vv) = 0 for some »’*-measurable set (null set), then 
P(N) © &X*, which in general, need not be measurable. 


2. The Construction of an Outer Measure by a Formatter. Let 
G be a fixed family of subsets of 2 identified as rectangles (by the 
analogue with rectangles in R” and their role in measuring sets) and 
y be a set function on G. For any subset Q C 2), we define the set 
& = €(G) of all at most countable covers of @ by rectangles. 
Then, it turns out that the set function p* on P(Q) defined as 
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coy _ J inf{ 2 WGn): {Gn} € B}, BED 
u"(Q) = { = : & -O 


is an outer measure. We call the pair (G,y) a formatter of the outer 
measure ju”. 


3. The Extension of Formatter (G,) to (X'(G),u}). A 
reasonable way to enlarge a formatter (from a measure-theoretical 
point of view) is to attain the more populous ’(G) and then have 
some measure on 4/(G), say ju, extended from y (that is, such that 
Resgjt = y). A desirable extension must clearly be unique. In light 
of Steps 1 and 2, accomplishing it in a plausible way suggests the 
formation of the outer measure ju* followed by the reduction of P(Q) 
to X’* on which ju* is a measure. 


But does this process leads to 4’(G) (which seems to be a smaller 
o-algebra containing G)? Unfortunately »’* is unrelated to »7(G), 
because G ¢ &*, and furthermore, p.* # y on G, unless we make 
some assumptions about G and y. If such conditions exist, we call 
formatter (G, y) extendible (through the outer measure it produces). 
If the formatter is extendible, then ('*, 5) is called the complete 
Carathéodory extension of (G,7) and its restriction to (3'(G), 49) 1s 
called a Carathéodory extension. 


4. The “Extendibility” Conditions for Formatter (G, 7). If the 
formatter (G,y) carries a semi-ring and elementary content, respecti- 
vely, then G C &™. If y is o-additive on G, then (X*, 45) becomes 
the complete Carathéodory extension of (G,). Consequently, the 
restriction from (2, j45) to (X'(G), 445) is a Carathéodory extension 
of the formatter. In other words, there is a measure on 5'(G) that 
coincides with y on G. Finally, if not only is y o-additive but also o- 
finite, the Carathéodory extension is unique. O 


We now turn to the details of the above procedure followed by a 
discussion of various formatters. 


2.1 Definition. Let (2 be a nonempty set and ju* be a set function 
defined on P((2). ju* is called an outer measure if: 
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a) *(O) = 0 (a reminder). (2.1a) 

|ACB => p*(A) < w*(B) (monotonicity). (2.1b) 
Qn} £ PM) > 1°( U Qn 

< SO u*(Qn) (o-subadditivity). (2.1c) 

O 


2.2 Definition. Let y.* be an outer measure on P((2). A subset 
M C (11s said to be pz*-measurable, if for any Q C Q, 


wQ) = wQAM) + wQN M*). (2.2) 


We also say that M u*-separates Q. O 


The following is what essentially constitutes the widely referred 
to Carathéodory extension theorem that we outlined above. For con- 
venience, we break it up into several theorems. The idea of an outer 
measure, along with the extension, belongs to the German mathema- 
tician (of the Greek origin) Constantin Carathéodory that appeared in 
his 1914 paper, “Uber das lineare Mass von Punktmengen - eine 
Verallgemeinerung des Langebegriffes” (in Gottingen Nachrichten) 
and in his widely referred-to 1918 book, Vorlesungen uber Reellen 
Funktionen (Teubner, Leipzig). We start with the following. 


2.3 Theorem. The collection &’* of all y*-measurable subsets is 
a o-algebra in Q. The restriction of p* from P(Q) to &*, in 
notation |4, is a measure. 


Proof. Because throughout the proof of this theorem we largely 
use equation (2.2) or prove its validity, we first notice that, due to o- 
subadditivity of j.*, as an outer measure, the inequality 


wQ)<SwQNM)+ wQ0M*) (2.3) 


holds true for all subsets, Q and M, of 2. 


Our proof includes the following steps. 
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a) { is obviously an element of »*, as it satisfies (2.2). If 
Me 2", then M° € X”*, by their symmetry in (2.2). 


b)We show that X’* is closed with respect to the formation of 
finite unions, that is, we show that with A, Be X*, AUBe d*. 


Because B € &", it follows that for each Q’ € P(Q), 
(QQ!) = #(Q'N B) + w(Q*N Be). (2.3a) 


Specifically, (2.3a) is valid for Q’=QNA and Q’=QNAY‘, 
Q € P(Q). Hence, 


w(QNA) = w(QN ANB) +p"(QN ANB’) 
and 


w(QNA®) = w(QN ANB) +p (QN ANB’). 


Summing up the last two equations and taking into account that 
Aé &™*, we have 


u(Q) = B(QN A) + w(QNA®) 
=u (QNANB)+uw(QN ANB) 
+ (QN ANB) + uwt(QN ACN B®), 
implying that 
u*(Q) = w(QN ANB) + uw(QN ANB’) 
+ pw (QN ASN B)+ w(QN(AU B)°). (2.3b) 
Now replacing Q in (2.3b) with QM (A U B) we also have 
u(QN (AU B)) 


=W(QN(AUB)NANB)+u(QN(AUB)NANB’) 
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+w(QN (AUB)N BNA) + u(QN(AUB)NASN B®) 

=p (QNANB)+W(QNANB’) 
+uw(QNBNA)+uw*(QNO). 

The latter reduces to 
uw(QN (AU B)) = #(QN ANB) 

+W(QNANB)+wV(QN BNA‘). (2.3c) 
Substituting (2.6) into (2.5) we get 


w(Q) = wIQN (AUB) +u"[QN (AUB), 


which shows that AU B € &”™. 
The above assertions a) and b) imply that &’* is an algebra in 12. 


c) Now we prove that * is a o-algebra in 1). Because »*, as an 
algebra, is (-stable, it is sufficient to show that ©’* is a Dynkin 
system. (See Problem 1.10 of Chapter 4.) 


Let {A,} C ©* be a sequence of disjoint sets. Take Aj, Ao 
€ {A,,}. Substituting A; = A and A, = B into (2.3c), taking A and 
B in (2.3c) disjoint, and then noticing that AM B® = A and BN A° 
= 5, we arrive at 


wIQN(A+ By =w(QNA) +u"(QNB). (2.3d) 


If A,,...,A, is an n-tuple of mutually disjoint elements of >”, 
then, by induction, from (2.3d), 


WIOMS,| = 3, ait (ON AR: (2.3e) 


where S,, = >\;-,Ax. Denote S = >-*°_, An. Because of S, C S, 
(QN S°) C(QN S*), and by monotonicity of 11*, 
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w(QNS*) < w*(QN Sz). (2.3f) 


Because >” is an algebra, it follows that S,, € X’* and hence it is pu*- 
measurable, 1.e., it j1*-separates Q, which, combined with (2.3e) and 
(2.3f), yields 


u*(Q) = u(Q Aq Sn) al w(Q M Si) 


nr 


> PAN At) + (QS), n= 1,25... 
c=1 


Therefore, 


w(Q) = Ve (QN Ag) + w(Q_NS*) (2.3g) 
that, by o-subadditivity, gives 
> MQ Yan) + (QNS") 
= 1"(QNS)+u"(QNS*). (2.3h) 
Inequalities (2.3) and (2.3g -2.3h) lead to 
w(Q) = w(QNS) + uw (QNS*) 
concluding that S = 5° ,A,, indeed p*-separates any Q C P(Q) 


and thus is an element of ’*. The latter supports the claim that X”* is 
a Dynkin system and, consequently, that ©’* is a o-algebra. 


d) We show that juj is a measure on &’*. Substituting the set 
S = >>>, An for Q in (2.3g), we have 


Ho (dont An) 2 Dope ta( Ar)» 


which, due to o-subadditivity of ju*, leads to the strict equality and 
thereby, o-additivity of jj. Therefore, we have proved that Resy-p*, 
denoted by jj, is a measure. 
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The proof is, therefore, completed. O 
2.4 Examples. 


@) Let 
O= 16.00), A={a}, A* = {b,c}, 


P=1b}, Q= {ec}, AR=4a, 6}, 8 =a; c}. 


Define the following set function p* on P(Q). 


One can easily verify that j.* is an outer measure on 
P(Q) = {O, Q, A, At i, a, i, S}, 


as it satisfies axioms (2.la-2.1c), but ju* is not a measure, because it 
is not additive. We can see that only the sets 0, 2, A, and A° p:*- 
separate all subsets of 22 and, consequently, {@,0, A, A°} is the o- 
algebra ’*. Clearly, uj, as the restriction of j.* on &*, is a measure. 


(it) Let Q be an infinite set. Define the set function y on P((Q) 
by 7(Q) = 0 if Q is a finite set and 7(Q) = 1 if Q is infinite. Let 
Q = {{w,}, n =1,2,...} be a sequence of all different singletons. 
Then, 


dina V({wn}) = 9, 


and 7(Q) = 1. Thus, y is not o-subadditive and not an outer meas- 
ure. CO 


Recall that a restriction of a function [X,Y, f] is a function 
[Xo, Yo, fo] defined on contracted domain X 9 C X with f = fo on 
Xo and Yo C Y. (In notation, fo = Resx, f.) From Theorem 2.3, we 
learned that the set function [X”*, [0, co], ju9] 1s a restriction of outer 
measure [P(Q), [0, oo], u*]. 


290 CHAPTER 5. MEASURES 


If X and Y are supersets of X and Y, respectively, a function 
[X,Y, f] is called an extension of f (from X to X), if [X,Y, f] is 
the restriction of f to X. (In notation, f = Extyf.) We will apply 
this notion to extend a set function y defined on a collection G of 
subsets of Q to a set function 7 on an expanded family €(G) of 
subsets of 9. For instance, in Example 1.2(ii) we defined the 
Lebesgue elementary content \° on the semi-ring S of half-open 
intervals in R". We can extend the Lebesgue elementary content \° 
to a (unique) content ,. on R(S) (see Problem 2.2), which turns out 
to be a premeasure on FR (verified in Theorem 3.1). The chief 
objective in this section is to construct an extension of a set function, 
such as premeasure, given on a ring, to a measure on the smallest o- 
algebra generated by this ring. Other matters, such as “completion” 
of a measure, will also be a focus of our discussions. 


2.5 Definitions. 


(i) Let (Q, 2’, 4) be a measure space. A measurable set N € 3 
is called a ju-null set (or just null set) if u(N) = 0. We denote the set 
of all p-null sets by NV. A set E is called ji-negligible (or just 
negligible), if there is a measurable null superset of E’. (Note that a 
negligible set need not be measurable.) The measure space is called 
complete, if for each null set N € N,, P(N) C X, that is, if all 
negligible sets are measurable. 


(ii) Consider a measure space (Q,X7,4). Let %' be the 
collection of all sets of type AU M where Ac » and M is any 


negligible set. According to Problem 2.8, »’ is a o-algebra. We 
extend ju to Zi on » by setting 


fi(AU M) = B(A) = w(A). 


According to Problem 2.8, the extension (5’, 71) of (27, jz) (or just 72) 
is complete. Measure ji is then called the completion of measure | 


and the measure space (2, »’, 7) is called the completion of measure 
space (Q, 3’, [1). O 


2.6 Example. Let Q=R, YX’ ={AeP(R): either A or 
A® x N}, which is a o-algebra on 2 [see Example 1.2 (vit), 
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Chapter 4], and let c, be the point mass. Both sets A = {+, n= I, 
2,...} and A® are elements of X’ and ¢,(A°)=0. Obviously, 
E =[2,c0), as a subset of A°, is negligible, but not measurable. 
Therefore, the measure space (IR, >’, €,) is not complete. (See a more 
general case in Problem 2.14.) O 


The proposition below is a paradigm of a complete measure 
space. 


2.7 Proposition. The restriction [13 of an outer measure |1* to the 
o-algebra &”* of all «*-measurable subsets of Q is complete and 
(Q, ©”, 45) is a complete measure space. 


Proof. Because yu* is defined on whole P(Q), for any pu*- 
negligible subset N C Q, due to (2.1b), .*CV) = 0 and, therefore, it 
is sufficient to show that N is ~*-measurable. Let Q C 2. Due to 
monotonicity of the outer measure, w*(Q MN) = 0 and w*(QN N°) 

< y*(Q) and this, along with (2.3), yields 


MQ) <wQON)+wWQANY) < w*(Q) 


and, hence, that N € »*. i 


The following is a construction of an outer measure by an arbitra- 
ry set function y defined on an arbitrary subcollection of sets 
G C P(Q). As usual, we only assume that G contains the empty set 
and that +, as a set function, is such that 7(@) = 0. This construction 
lies in the basis of the Carathéodory extension of the set function + 
to a measure on o-algebra 3/(G). 


For any subset Q C (2, denote by €Q(G) the collection of all at 
most countable covers of set @ by elements of G. [Unless there is 
another subcollection, besides G, under consideration, we will for 
brevity drop G in €(G).] Therefore, if 4% #@, for any {G,,} € 


Z, we have Q C UG». 


n=1 


2.8 Proposition. The set function j1* defined on P(Q) as 
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eo J inf(o71(Gn): {Gn} © @}, BAD 
w= {a BuO (2.8) 
is an outer measure. 


Proof. We need to verify the above properties (2.1a-2.1c) of pi* 
as an outer measure: 


a) Because @ € G and 7(@) = 0, it follows that .*(@) = 0. 


b) We assume that both p*(A) and *(B) are finite, because 
otherwise, the proof is obvious. If AC B, @%C & and then we 
can reach on & a possibly smaller limit inferior than that on @. 
Therefore, 


pi(A) < inf{ OG): {Gu} € GB} = p(B). 


ce) Let {Q,) 6 P(Q) and O =|) Ons If for-at least one nm, 
n=1 
€,, = O, then also @ = © and subadditivity immediately follows. 
We assume that for all n, 2, # @ and choose an e > 0. From 


[u*(Q);) — inf{ 1 (Gin): {Gin} € Z,}, 


and by the definition of a limit inferior, it follows that for €2~', there 
is acover {G;,, 2 =1,2,...} € @, such that 


wO)s yy Gw <P OQj+22~. 


Now, clearly {G;;,, i,n = 1,2,...} € @&. Thus, 


H(Q) S e  WGin) S VE (Qi) +, 


which proves monotonicity. O 


We call the couple (G, ) [that is, a subset G of P(Q) and a set 
function y on G] a formatter of outer measure ,1* defined by (2.8). 
As has been shown, the formatter and, subsequently, the outer 
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measure, induced the o-algebra ¥’*, on which ju* was a complete 
measure. 


When constructing a measure space (Q, &*, uj) by (G,7), the 
major goal is to extend y from G to a measure, say ju, acting on the 
smallest o-algebra ’(G) generated by G. This can be achieved by 
restricting (X*, j15) to (XG), 4), given that (2, j15) itself is an ex- 
tension of (G,7). The latter, however, is not guaranteed from the 
above construction unless we impose some restrictions upon the 
formatter (G, y), for even though (G, 7) produces (Q, »*, 115), (G, 7) 
need not have all elements j.*-measurable. In other words, G need 
not be a subset of X'*. In addition, j1j need not coincide with y on G. 
For example, if 7 is an elementary content and G is a semi-ring, then, 
according to Problem 2.2, for each G € G, there is a cover {C’,} of 
G such that U°° | C;, is a decomposition of G' and 


1 (G) = Vina (Cn). 


Hence, in order that .*(G) = 7(G), y must be o-additive on G, 
which, in general, is not. 


Consequently, we call (*, 445) [generated by (G, y) in (2.8)] a 
complete Carathéodory extension of (G,y) if G C &* and Resgpj 
=. If (£*, 5) is a Carathéodory extension of (G,), then the 
formatter (G,y) is said to be extendible and the corresponding 
restriction of (27*, 5) to (2G), j4) is referred to as a Caratheodory 
extension of (G,7). 


As mentioned above, one of the most important questions arises, 
as to what the formatter (G, y) should really be to be extendible and, 
consequently, generate a Carathéodory extension. By now, we have a 
fairly large choice of systems of sets and set functions on them, 
ranging from semi-rings to o-algebras and elementary contents to 
measures. The idea is, however, to select a possibly more rudimen- 
tary formatter (G,7), which is tame and suited in most common 
practical applications and constructions and such that (%”*, 5) is an 
extension of (G,¥). In particular, this means that the elements of G 
have to be y*-measurable. The following theorem, which is a crucial 
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step in the whole extension procedure, infers that (G, y) can be a ring 
and premeasure to serve as a reasonable and extendible formatter. 


2.9 Theorem. 


(i) Let (G,y) be a_ semi-ring and elementary content, 
respectively, in 91, which produce the outer measure p* and o- 
algebra %* of p*-measurable subsets of Q. Then G © X™. 


(ii) If, in addition, y is c-additive on G, then y = Resgi* and 
therefore (3*, j19) is an extension of (G, 7). 


In a nutshell, the formatter permits G to be as basic as a semi-ring, 
but the set function y must be o-additive. Now, if we extend G to the 
system that includes all finite unions of “rectangles” from G, by 
Theorem 2.6, Chapter 4, we arrive at the ring R(G) to which ¥ is 
readily and naturally extendible, considering the utility of Lemma 
2.5, Chapter 4 [according to which all figures of R(G) are unions of 
pairwise disjoint rectangles from G] and Proposition 2.11 below, 
surely preserving its o-additivity. Hence y~ becomes a premeasure on 


R(G). 
Proof. 


(2) We have to show that G C %"*, i.e., any element, G € G, l*- 
separates all subsets of 2. Take any subset Q CQ with 2 4 Q, 
because, otherwise, the proof would be trivial, and let C = {C’,} be 
any (countable) cover of @ from @. ForaG € G, and C;, € C, 


Cr =(ChnNQ+(C,NGD=(C,NG+(G\@. (29) 


Because G is a semi-ring, C!, 1G is an element of G and C,,\G can 
be represented as a finite union of pairwise disjoint elements of G, 
say yo Consequently, (2.9) can be rewritten as 


Cy = (Cy q G) a 5, 


rm 


and, by finite additivity of y, 


(Cn) = (Cn NG) + DHE HSj_)- (2.9a) 
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Now, suppose }>°°_,y(C;,) < oo. Then, summing up all equations in 
(2.9a) in n gives 


in Cn) = ya MGa NG) + Sn) «= (2.9) 
where {5,} is the reordered sequence {5;,, j=1,...,Nn, 
@= ly2yex | 


As Q=(QNG)+(QNG*), obviously, {C, NG} CG and 
{S,,} CG are covers of QNG and QNG*, respectively. Conse- 
quently, 


ata G) SOG) 


and 


pate fo) > w(QNG*), 


and then by (2.9b), 


yop (Cn) = w(QNG) + w(QNG®). 


Because this inequality holds for every cover C of Q, it should also 
hold for the limit inferior to yield 


wWQ) 2 WQNG) + wQNG*). (2.9) 


If $0 7(Cn) = co, then “equals symbol” in (2.9b) has to be 
replaced by “ > ” to yield (2.9c) again. The inverse inequality is due 
to (2.3). Therefore, G separates all subsets of (2 and, consequently, 
we have G C »™. 


(it) By Problem 2.2, for each G € G, there is a cover {C,,} of G 
such that G = 5>°..,C), and *(G) = S5°°.,7(C,,). Hence, if 7 is o- 


additive, p* coincides with y on G. 


These two facts prove that (*, 5) is an extension of (G,y). UO 
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2.10 Remarks. 


(i) One should bear in mind that, although (G, y) can serve as 
an extendible formatter for the outer measure pu*, G is not really a 
generator for »’*, as the latter need not be the smallest o-algebra 
containing G. We want to make a clear distinction between these two 
terms. Recall that a family G C P(Q)) is said to be a generator of 
another family (G C ) Fy C P(Q) with a property P, if Fo is the 
intersection of all supercollections of G on each one of which 
property P holds. In our case, %’* will eventually contain the 
smallest c-algebra 3’ = »'(G) and, in general, p.* needs to be further 
restricted to this o-algebra. From Theorem 2.9, we conclude that any 
elementary content y on a semi-ring G, which is o-additive, can be 
extended to a measure pp = Resyig)u" (acting on the smallest o- 
algebra »’ generated by G). In other words, if y is a o-additive 
elementary content on a semi-ring G, then there exists at least one 
extension, namely, Carathéodory's extension. 


(72) From the proof of Theorem 2.9, it is obvious that a semi- 
ring of rectangles, with a c-additive elementary content on this semi- 
ring, is one of the most economical systems suitable for the 
Carathéodory extension. However, it is often more preferable and 
common to start with premeasures on rings. In practice, one can first 
extend a semi-ring, with an elementary content, to the smallest ring 
with the content using the procedure proposed in Lemma 2.5 and 
Theorem 2.6, Chapter 4, and next Proposition 2.11. 


(277) Another reasonable question arises: in how many different 
ways can a formatter (G,y) be extended to a measure on »/(G)? 
Forthcoming Theorem 2.13 asserts that under some relatively minor 
restriction (Remark 2.12) made on set function 7, the uniqueness of 
Carathéodory's extension is guaranteed. O 


We begin with one useful extension of an elementary content on a 
semi-ring to a content on the smallest ring containing the semi-ring. 


2.11 Proposition. There is exactly one content on R(S), which 
coincides with the elementary content on S. O 
(See Problem 2.3.) 
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2.12 Remark. In Definition 1.1(vi2) we introduced the notion of 
o-finiteness of a set function. Sometimes it is more convenient to use 
another definition of o-finiteness, which is equivalent to Definition 
1.1(viz) for a large class of set functions. Namely, the condition to 
have a monotone increasing sequence {G,,}7 Q from G, with 
(Gi) < oo for all n, can be replaced with the equivalent condition 
that there is a countable partition {0),Q2,...} CG of Q(==U™, 
Q2,,) such that y(Q,,) < oo for all n. For instance, rings with contents 
clearly provide a basis for such equivalence. For a semi-ring with an 
elementary content, the first definition yields the second one, 
because we can arrange from {G,,}f 2 a countable decomposition; 
however, the converse, obviously, does not hold. 


Another related feature we use in the sequel is the o-finiteness of 
a particular set. Let (2, »’, 41) be a measure space. A measurable set 
A is said to be o-finite if Resy,,4p is o-finite. O 


2.13 Theorem. Let G be a | -stable generator of the o-algebra 
3(G) in Q such that G contains a monotone increasing sequence 
{By}? Q. Let py and jg be two measures on 33(G), which are o- 
finite on {B,,} and coincide on G. Then 1, = [tg on 37(G). 


Proof. Let A€G such that p1(A) = p2(A) < oo and let 
Dy={BeX: w(AN B) = p2(AN B)}. We show that Dy is a 
Dynkin system: 


a) A € Dy, implies that Q € D4. 


b) Let DE Dy. Then AN D° = A\D= A\(AND), which 
implies that 


fn(AN D°) = pi (A) — (AN D) 


= ji2(A) — (AN D) = p2(AN D®), 


and this leads to D® € Dy. 


c)Let {D,,} be a sequence of disjoint sets from D4. Then 


pi(A M De = m(S2A a Dy) = (A M Dn) 


n=1 n=1 n=1 
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- ¥ #(AN Da) = po(S2 AND) = po(AN Dy). 


n=1 n=1 


Hence )> D,, € D4, and therefore D4 is a Dynkin system. Obvious- 
n=1 

ly G C D4. Consequently, G C D(G) C Dy . Also because G is M - 

stable, it follows that D(G) is a c-algebra. Hence, we have 


G CDG) = X(G) CDs C XG) 
leading to 


In particular, we proved that VB € 37(G) w1(AN B) = p2(AN B). 


Now let {B,,} be a monotone increasing sequence of sets from G 
convergent to 2. Thus X'(G)=Dg,. Then Vn =1,2,..., and 
VB e€ XG), 


fiq( Ba NM B) = j2(Bn M iB), 


Because {B,, 71 B}7} B and because p4;(BM B,,) < oo, by Lemma 
1.6, 


jim, f41 (B a By) = jim. [lo(B a By) 


= jn(B) = p(B), VB € XG). U 


Now, by means of Theorem 2.13 we easily deduce the following 
significant statement. 


2.14 Corollary. Let y be a o-finite and o-additive elementary 
content on a semi-ring G. Then a Caratheodory extension of y to a 
measure on o-algebra 37(G) is unique. O 


The lemmas below are used for various purposes and, in parti- 
cular, lead to a relationship between the completion (Q, 2’, 72) of a 
measure space (Q2, 2’, j4) and the o-algebra * of all jz*-measurable 
sets. 
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2.15 Lemma. Let (0,G9,7) be an extendible formatter of the 
outer measure t* and G, the collection of all at most countable 


unions of elements from G. Then, for each Q CQ, there is a set 
G, €G,, such that G, D Q and 


U(Go) SwWQ) +e. (2.15) 


Proof. Because j.* is generated by (G, 7), we have 


uw(Q) = { intl dene Ga): {Gay = Gy}; . 7 . (2.15a) 


If pw*(Q) = co, then inequality (2.15) holds trivially. Suppose 
Lu*(Q) < co. Then, by definition of a limit inferior and from (2.15a), 
for every « > 0, there is {G,,} € @ such that 


w(Q) +e 


= yi (Ga) = yt _sWGn) 


k 
Ss eeu ( U Gn). (2.15b) 


n=1 


Now, we make use of the fact that (G, y) is an extendible formatter. 


k 
This implies that not only G C »*, but also G, C X*. Since U, Gp 


; = 
is monotone increasing and u( 5,Gn) < oo for all k, by conti- 
n= 

nuity from below (Lemma 1.6), 


U G,). 


n=1 


singe (Ue) =( 


Passing to the limit in (2.15b), which holds true for all k, we prove 
(2.15) with G, = U G, being a pledged set. O 
n=1 
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Lemma 2.16. Let {u* be an outer measure, 5* the o-algebra of 
all *-measurable sets, and A any subset of Q. If there is a u*-meas- 
urable set B such that B > Aand y*(B\A) = 0, then Ac &". 


Proof. Because B € %*, it should ju*-separate Q: 
WQ) = HQ B)+ wQN BY). (2.16) 
Now, because A C B, we can easily show that 
QN AS =QNB°+QN(B\A). (2.16a) 


From Q 1 (B\A) C B\A, it follows that u*(Q 9 (B\A)) = 0. From 
(2.16a), 
wWQN A) = w(QN B). 


Consequently, we can replace u*(QM B°) in (2.16) by w*(QN A°). 
Finally, noticing thatQ MBC QN A, we have that 


HQ) = HQ B)+wQN BY > wQN A+ wQN A, 


and this is the required inequality. O 


Lemma 2.17. Let y* be the outer measure generated by an 
extendible formatter (G,y), &’* be the o-algebra of all .*-measur- 
able sets, 1) be Resy-pu*, and let 3/(G) be the o-algebra generated 
by G. Then, for every A* © &* such that p5(A*) < ov, there is a set 
Be SG) with B D A* and y5(B\A*) = 0. 


Proof. Because j15(A*) < co, @ #@. From Lemma 2.15, for 
every € > 0, say 7, there isa Gk = J Gk D A* such that p5(G*) 


n=1 
< pi(A*) +e. The latter yields that pj(GS\A*) < 7. Obviously, 
(| G* is still a superset of A* and in as much as ()G* CG", it 
k=l k=1 
follows that 


ph(Dm) < 3(G™) < 4 (2.17) 
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where Dn = @ ct) \A* € X*. The sequence {D,,,} is clearly 
k=l 


monotone nonincreasing and /15(D,) < oo. Therefore, by continuity 
from above [see Theorem 1.7 (z)] of 5 and because of (2.17), 


n lim H9(Dm) = Hi (A ct) \a'} = 0 


CO 
The set () G* can serve as a set B asserted in the lemma, as it 


k=1 
obviously meets the requirements regarding set B in the statement 
and we are done with the proof. O 


Corollary 2.18. Let :* be the outer measure generated by a o- 
finite extendible formatter (G,y), 2’* be the o-algebra of all 1*- 
measurable sets, and let 37(G) be the o-algebra generated by G. 
Then, for every A* € &™*, there is a set BE X(G) with BD A* 
and j*(B\A*) = 0. 

Proof. (G,y) is o-finite. Thus there is a partition {H), Ho, 
...} CG of OD such that y(H;,) < oo. If A* € &*, then 


{At = A*N Hy, k =1,2,...} 
is a y*-measurable partition of A*, with y*(Aj) < oo for every k, 


and to each of which we can apply Lemma 2.17 and have a set 
B, € &(G), with B, D Aj and y*(B,\ Az) = 0. 


Notice that because 


(Um )\cam4y) = (Ua) n (Aas) 


n=1 


= Gia (Bewr)} 


{Bi N(AR)F, 


C8 


C 
k 


it holds true that 
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| (Um) Cami) 


<u |UBi\a] < Dew" B\As) = 0. 
The statement follows after letting B = J) B;, (€ 2(G)). O 


k=1 


Now, with the aid of the above propositions, we can finally 
answer the question about the relationship between the completion 
(Q, 5’, 72) of a measure space ((Q, 5’, jz) and the measure space 
(Q, ©, 445) (which is also complete, as we learned it). 


2.19. Theorem. Let (G,y) be an extendible formatter for 
(Q, &*, ug) and a generator for the measure space (Q, 7 = a(G), 
pt = Ress *) whose completion is (Q, 2’, 72). 


(i) Then, 3 Cd", 
(ii) If (G, 7) is o-finite, then ©} = X* and fi = 1}. 
Proof. 


(i) Obviously, ©’ C ¥* if and only if any element A of Y is of 
the form AUN, where AC XY, N is p-negligible, and A is p*- 
measurable. According to Lemma 2.16, AUN would be ,i*- 
measurable, if there is a j4*-measurable set B such that B > AUN 
and y*(B\(AU N)) = 0. By Definition 2.5(2) of a p-negligible set, 
N must have a 5’-measurable j-null superset, say No. [Note that 
even though, by Problem 2.10, u*(N) = 0 and p*(A UN) = p*(A), 
this does not warrant AU N € &*.] Because A U No is a superset of 
AUWN and, by Problem 2.11, (AU No)\(AUN) is a p*-null set, 
B= AUN, meets all prerequisites of Lemma 2.16, which makes 
AU N indeed ju*-measurable. This proves part (2) of the theorem. 


(ii) Because of part (i), we need to show that * C 3’, that is, 
each A* can be represented as the union of a ju-measurable set and ju- 
negligible set. By Problem 2.12, for any A* € »”*, there is a ¥’- 
measurable subset B of A* such that j.*(A*\B) = 0. Obviously, A* 
can be decomposed as B and ju*-measurable null set A* N B®. It only 
remains to show that A* M B° is ju-negligible. 
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By Corollary 2.18, for A*, there is a set C € »’ such that C D A* 
and u*(C'\A*) = 0. The set-difference C'\B = (C'\A*) + (A*\B), 
as the union of two j*-null sets, is a j:*-null set, therefore, a p-null 
set (as C\B € »’). This proves that A* M B° is ju-negligible. 

Now, we show that 7 = uj. (Recall that they are equal on »’.) 
Because 1’ = *, A* = AUN, where A € »' and N is p-negligi- 
ble, and 


p(A*) = p(A) = p*(A). (2.19) 


On the other hand, there is a p-null superset of N to yield 
u*(.N) = 0 due to monotonicity of ju*. Finally, from the inequalities 


p*(A*) < w*(A) + w*() = p*(A) 


pt(A* = AUN) > *(A), 


and 


it follows that p*(A*) = y*(A) and this, along with (2.19), yields 
that 7i(A*) = y*(A*) for each A* € S* = DY. O 


Example 2.20. If (Q, 2’, 1) is a probability space, it follows from 
Theorem 2.19 that the completion of (2’, 1) coincides with (&”*, 115) 
induced by (2’, jz) or by a “smaller generator” (G, y) of (X’,). UO 


A noteworthy question arises: if we have a semi-ring and o-addi- 
tive elementary content, would it make any difference if we first 
extend them to the smallest ring and premeasure, according to Pro- 
position 2.11, and then use the Carathéodory extension to arrive at 
the smallest c-algebra and a measure on it, or alternatively, apply the 
Carathéodory extension directly to that semi-ring and o-additive 
elementary content. The same question applies, say, to a ring with a 
premeasure and the generated o-algebra with a measure. The 
difference, if any, can apparently be manifested by the emergence of 
two outer measures, induced by a formatter and its extension. 


2.21 Theorem. Let (Q,G, 0) be an extendible formatter of outer 
measure jt” and a-algebra »* of *-measurable sets and let 
(E = E(G), 7) be an extension of (G, yo) and an extendible formatter 
of outer measure v* and B*, such that E C X&* and y = Resep*. 
Then, v* = p* on P(Q) and &* = B*. 
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Proof. Let Q C 12. Because @(G) C @Q(E), obviously 


v*(Q) < *(Q), (2.21) 


which yields the equation v* = 4* on a subcollection of sets 
Q € P(Q) with v*(Q) = oo. Suppose v*(Q) < oo. Then, for every 
5 > 0, there is a cover {E,} € €(E) with 


done (En) < v*(Q) + §. (2.21a) 


Because y = * on € and y(E,) = p*(E,) < 00, for each «2-1 
> 0, there is a cover {G',;,, k = 1,2,...} € G;(GQ), such that 


ope 10(Gnz) < w*(En) +e2-7°™ 
=(E,)-e2", (2.21b) 
Because {Gn,,7,k = 1,2,...} € G(G), from (2.21a) and (2.21b), 


HQ) S Viner 0(Gnz) SV*(Q) +545. (2.21c) 
Finally, taking in (2.21c) € = 4 leads to the inverse of inequality 
(2.21) and proves that * = v* on P(Q). The outer measure is the 
mere generator of the o-algebra of separating sets. Consequently, 
pu’ = v* yields that X’* = B*, which completes the proof. O 


An important consequence of Theorem 2.21 is the following. 


2.22 Corollary. Let (S,y0) be a semi-ring, o-additive and o- 
finite elementary content in Q, and let (E = E(S), y) be an extension 
of (S,yo) such that EC X(S). Suppose (2', = X7(S), 5) and 
(= V(E), fe) are the Caratheodory extensions with their 
respective outer measures ju; and > and o-algebras 7; and 3); of 
measurable sets. Then, the following hold true. 


Nv =5,=2.. 


2) 6 = Hs = Me On XY. 
3) w* = ps = we on P(Q). 
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4 == 
Proof. 


1) From € C 3’, we have ©, C D’,. From S C E C 2, it follows 
that 7, C de. 


2) Now measures ju, and pie act on the same o-algebra »’ and 
coincide on semi-ring S. In as much as yo is o-finite on S, by 
Corollary 2.14, ws = ple on Y’. 


3) With 
SCceCcs, C27. p= Ressy, —Resxp, 
and, consequently, 
“= Rese, = Rese/i7, 


we meet all conditions of Theorem 2.21 to have uw? = ut = p*. 
4) ©* = Li¥ = LF also by Theorem 2.21. C 


For instance, € can be a ring generated by S and 7, the extension 
of the elementary content yo in accordance with Proposition 2.11; or 
E can be an algebra with 7 as a premeasure or € can even be the o- 
algebra ¥’(S). In particular, it follows that, once the Carathéodory 
extension from (S,y) to (2(S),w) is rendered, another 
Carathéodory extension of (€,77) would be redundant. 


Another consequence of Theorem 2.21 is the uniqueness of outer 
measures generated by measures. 


Corollary 2.23. Let 1 be a measure on a o-algebra 3’, which 
produces the outer measure * with o-algebra %* of measurable 
sets. If there is another outer measure {i *, then fi * = u* on P(Q) 
and S* = 3 

Proof. This is a direct application of Theorem 2.21 with the 
following identification of the above characteristics: 


1) Let @ be a measure on »’ such that 77 = yw. Then 7 can serve 
as an extension of (2’, j). 
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iC e, 
3) v= = Resp’. O 


Remark 2.24. Corollary 2.23 is useful in various applications of 
Carathéodory's extension. Suppose 7; and y2 are two elementary 
contents coinciding on a o-finite semi-ring S (i.e. they are o-finite 
on S). By Corollary 2.14, their respective Carathéodory extensions 
fii and pa must coincide on X(S). Let pi and ps be the 
corresponding outer measures, according to Corollary 2.22, produced 
by y and 72 or js; and jig (regardless). By Corollary 2.23, uw} = 5 
on P(Q) and Xf = X75. O 


As in Theorem 2.21, by comparing two measures generated by a 
set function acting on a collection of sets and their extension, we 
ended up comparing two corresponding produced outer measures. It 
seems to be reasonable to raise another question: what if an outer 
measure will produce another outer measure? Would this make any 
difference? More specifically, can the restriction jj of an outer 
measure j* on »* become a formatter of another, different from p*, 
outer measure? Note that this is a different scenario from one 
considered in Theorem 2.21, because here ju* is not supposed to be 
generated by a formatter and it “acts on its own.” The following 
example shows us this distinction. 


2.25 Example. Consider P(Q), w*, &*, and yj in Example 2.4 
(2): 


0 =10,0,c}, A={a}, A°={bc}, P ={ob 


Q= ee R= {a, b}, S= {a, Ch 


p*(AS) = (BR) = w*(S) = 3, w*(P) = w*(Q) = 2, 
37* = {@,0,A, A}, and pp = Ressxp*. 


Then, generate the outer measure v* by (&™, /15). So, we have: 
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As we see, in most cases v* is strictly greater than w* on P(Q). OF 


According to Problem 2.1, if v* is an outer measure induced by 
i, then u* < v* always holds on P(Q). That u* = v* requires some 
restrictions, such as those in the proposition below. 


2.26 Proposition. Let p* be an outer measure on P(Q), 
Li) = Ress-uu* and v* be the outer measure produced by (2, 1). 
The equation * = v* holds true on P(Q) if and only if for every 
Q€eP(Q), there exists a set A* € X* such that A* >Q and 
p(A) = w"(Q). O 
(See Problem 2.17.) 


2.27 Remark. If j* is generated by an extendible formatter 
(G, 7), then clearly p* = v*, due to Theorem 2.21, as (uj, &*) can 
serve as an extension of (G,7). Alternatively, if Q € P(Q), 
according to Lemma 2.15, for each positive ¢, there is a set Gz € G, 
(a collection of all countable unions of elements from G) such that 
G, > Q and w*(G,) < u*(Q) +e. We assume that v* is the outer 
measure generated by j15. Because * = v* on P(Q) and G, € &”, 
we have *(G,) = v*(G,) and, by monotonicity, v*(G,) > v*(Q). 
Thus, we have 


U*(Q) S w(Go) = V"(Go) < w(Q) +, 
which yields v*(Q) < y*(Q). The inverse inequality is due to 


Problem 2.1. O 


2.28 Theorem. Let (9, 57,1) be a measure space such that 
3} = 31(S) with S being a semi-ring, and 1 be o-finite on S. Then, 
given A€ X'(S) and ¢ >0, there is a disjoint countable cover 
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{Si} CS of A that approximates” A, that is, such that 
A CG XP On and pl (57248n)\A) <2. 


Proof. Let 7 = Ress and j* be the outer measure produced by 
(S,y). Then, ju is the unique Carathéodory extension of y from S to 
37(S), according to Corollary 2.14, and pz = Resypu*. 


Case 1. Let w(A) = p*(A) < co. 
Then, by (2.8) (of Proposition 2.8), for each ¢ > 0, there is a 
sequence {G',} € & such that 


wenn V(Gn) = Vinn1(Gn) < WA) + €. 
Since u( U Gs) < SO" W(G,), we have that 
n=l 


»({ Be.)4) <e 


Case 2. j(A) is arbitrary. 

By o-finiteness of j1 on S, there is at most a countable decompo- 
sition \0P. Ox of Q by {Q;} CS such that p(Q,%) < co and hence 
[AM Oz) < co. Now, we apply the above arguments to AM Q;, and 
5c Hence, there is a sequence {G'n,} C SM Ox such that 


This leads to 
ad om U Gnk)\(A 7 2) <eé 


and thus to 


=i ( 5 6,)\4) <6; 


where G,2 =), i Gne: 
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Finally, it remains to form a disjoint sequence of semi-ring sets as 
stated in the theorem. The latter can be rendered in the same way as 
in Lemma 2.5 of Chapter 4. O 


2.29 Corollary. Jf under the condition of Theorem 2.28, 
[u(A) < co, then A can be approximated by just a finite tuple of 
disjoint semi-ring sets. 


Proof. Because (A) < oo, by Case 1 of Theorem 2.28, so is 
uur, G',) < oo. Then, by continuity from above, for each « > 0, 
there is an N such that 


MO nti Gr) <€, foralln > N 
thereby leading to 


(AAR Sk) < 2e. o 


PROBLEMS 


2.1 Let ju* be an outer measure on P(Q2), uj = Ress+pu*, and v* 
be the outer measure induced by uj. Show that p* < v* on 
P(Q). 


2.2 Let (G,y) be a formatter of the outer measure ju* defined by 
(2.8). Show that if 7 is an elementary content and G is a semi- 
ring, then for each G € G, there is a cover {C,,} of G such that 


G= yenniCn and L(G) a pig (6-9) 


2.3. Prove Proposition 2.11. There is exactly one content on R(S), 
which coincides with the elementary content on S 


2.4 Let yw be a finite measure on (Q, ©’) and let 3’ be any sub- 


collection of X’. Show that, for any fixed subset Q CQ, it is 
true that 


inf{p(A): Q © A} = w(Q) — sup{p(A*): Q C A}. 
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2.6 


Ze 


2.8 


2.9 


2.10 


2.11 


2.12 


2.13 
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Show that the original definition of o-finiteness 1.1(vi2) 
implies the second definition of o-finiteness for semi-rings and 
elementary contents mentioned in Remark 2.12. 


Let y* be an outer measure on P(Q) and {A,,} a sequence of 
disjoint j:*-measurable sets. Show that for any Q C Q, 


u*(Q M ein) = ye (2 q An). 


Let N € N,, (ie., a p-null set) and let B € &’. Show that 
w(N UB) = p(B\N) = p(B). 


Show that X’ defined in Definition 2.5(i7) is a o-algebra, 77 is a 
measure, that this extension does not depend upon 
representations of sets of X’, and that (Q, X’, 72) is complete. 


Show that the measure space defined in Example 1.2(iv) is 
complete. 


Let y* be an outer measure on P(2Q) and N C 2) be such that 
p*(N) = 0. Show that for any subset Q CQ, w*(QUN)= 


u*(Q). 


Show that (A U No)\(A UN) in part (i) of Theorem 2.19 is a 
j*-null set. 


Let p* be the outer measure generated by an extendible o-finite 
formatter (G,y), &’* be the o-algebra of all ju*-measurable 
sets, and let 3’(G) be the o-algebra generated by G. Show that 
for every A* € &*, there isa set B € X'(G) with B C A* and 
u*(A*\B) = 0. 


Let (Q, Xo, 0) be a completion of a measure space (Q, %’, j1). 
Define for each A C 0 f(A) = sup{u(B): Be ©, BC A} 
and y(A) = inf{(B): Be », A C B}. Show that if 
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2.14 


2.15 


2.16 


2.17 


2.18 


a) A€ Sp, then f(A) = (A) = pol), 
b) f(A) = u(A) < 00, then A € Xp. 


Let »’ be a o-algebra in (2 and let a € 22. Show that for 
{a} € 3’, the measure space (2, 3’, €,) is complete if and only 
if = Pi). 


(Generalization of Problem 1.12.) Let (Q, 2’, 2) be a measure 
space, {4 be o-finite, G.. = {G € Y: u(G) < oo}, and YQ = 
{Q CO:QNGELX,VG € Gx}. Show that Y = XY. 


Let (0, 27,1) be a measure space, G. = {Ge X:u(G) 
< oo}, and Yo ={Q CN:QNG € Y,VG € Gx}. Define 
the set function j4., on 7, as 


_ J oQ), QE 


Problem 1.13 claimed that j1., was a measure on 3. Show 
that if the measure space (Q, ’, jz) is complete, then so is [.0. 


Prove Proposition 2.26. Let yu* be an outer measure on P(Q), 
[iy = Ress»p*, and v* be the outer measure produced by 
(X)*, uj). The equation y* = v* holds true on P(Q) if and 
only if for every Q € P(Q), there exists a set A* € X* such 
that A* > Q and y*(A) = p*(Q). 


Let p~* be the outer measure generated by an extendible 
formatter (G,y) on a non-empty set 2, »”* be the o-algebra of 
all j.*-measurable sets, and 4’(G) be the o-algebra generated 
by G. Show that a subset N C 1) is negligible if and only if 
uN) = 0. 
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3. LEBESGUE AND LEBESGUE - STIELTJES 
MEASURES 


In this section, we utilize the results of the previous section for the 
construction of Lebesgue and Lebesgue - Stieltjes measures. We have 
learned that to warrant the Carathéodory extension, a given formatter 
should be at least a semi-ring and o-additive elementary content, 
which applies to some important special cases of formatters in 
Euclidean spaces. In Theorem 3.1 below, we show that the Lebesgue 
content is o-additive on the ring R(R”), which implies that the 
Lebesgue elementary content is also c-additive on the semi-ring of 
half-open intervals. Although it is possible to prove this statement 
directly (with no prior extension and @-continuity arguments, as in 
Theorem 3.1), in order to exploit the equivalence of O-continuity and 
o-additivity, we prefer to start with the extension of the Lebesgue 
content on the ring R(R”) to the Lebesgue premeasure. 


Theorem 3.1. The Lebesgue content X, on the ring R(IR”) is o- 
additive, that is, a premeasure. 


Proof. Because the Lebesgue content \, is finite on R, by Propo- 
sition 1.7 (iz), A. would be a premeasure if it were O-continuous. We 
use the equivalent version of @-continuity: 


For every monotone decreasing sequence {A;,}| CR with 
Ac(A1) < co, the assumption that Jim Ac (Ax) (which clear- 
—00 


oe) 
ly exists) is strictly positive must yield that (| Ay, # ©. 
k=1 


Let {.A;,} be any such monotone decreasing sequence with 
e= jim Ac(Ax) > 0. (3.1) 


It is readily seen that (3.1) implies that for eaeny k, Ap #@, and 
therefore, by Cantor's Theorem 5.4, Chapter 2, n A, # ©. How- 


ever, the nonempty intersection of the dita of A,'s does not 
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[oe 
imply that the intersection () A; 4 @ either. To overcome this pre- 
k=1 
dicament we construct a subsequence of compact subsets of A;'s 
whose intersection is nonempty. 


Now, in as much as A;'s € R, each A; can be represented as a 
finite union of disjoint half-open rectangles, say :’_, P, (for brevity 
let us drop index k) such that A.(P;) > 0. Then for each value of € 
and for every P;, there is a half-open rectangle 7, whose closure 
IT, is a proper subset of P, and such that 


Ae(Ps) < Acs) + =e (3.1a) 
Bound (3.1a) yields that 
AMAR) < Ac(Br) + 5E (3.1b) 


where B;, = >~)_,/I,. Obviously, B, C Ax. It seems as if we are 
done with the sequence {B;,}. However, the claim that () B, 4 @ 
k=1 


is unwarranted, since {B;,} need not be monotone decreasing. There- 
fore, we introduce 


k 
CO. =/( 12a 
jel 


which forms a monotone nonincreasing sequence of sets termwise 
dominated by { A; }. Now, we need to show that C;, 4 @. In fact, we 
will be able to prove a much stronger statement that A.(C;,) > 0 for 
all k. Namely, we prove that 


A(Cr) 2 A(Ag) _ e(1 _ ar) (3.1c) 
which, because of .(A;,) > €, would yield the needed inequality: 
AACE) = ear. (31d) 


We prove (3.1c) by induction. For k = 1, (3.1c) holds true, because 
from (3.1b), 


Ac(C1) = Ac(Bi) = Ac(A1) — se. 
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Now we assume that (3.1c) holds for some k > 1 and show the 
validity of (3.1c) for k+1. Because of Chi, = Byiy AC, and 
Proposition 1.5(72), 


Ac(Brsi U Ce) = Ac( Brat) + Ac(Ce) — Ac(Crsi). (3.1) 


Due to (3.1e), the inequality \,(Bi41) > Ac(Anii) — sere [by (3.1b) 
for k + 1], and the assumption about the validity of (3.1c) for some 
k, we have: 


Ac( Bri U Cr) = Ac(An+) 
+ Ac(Ag) — Ac(Ck41) — €(1 — sez). 
Since obviously By; UC, C Az, we have 
Ac(Ag) = Ac(Brsi U Cr), 
and hence 
Ne(Cr+1) > Ac(Ar+1) — €(1 — se) 
+ A(Ax) — Ac(Brai U Cr) 
> Ac(Ax+1) — (1 — gaz). 
This proves (3.1c) and (3.1d) and thereby yields that {C;,} is a 


monotone nonincreasing sequence of nonempty compact sets; hence, 
by Cantor's Theorem 5.4, Chapter 2, 


Ar 2: #9. 


k=1 k=1 


In conclusion, A, is indeed a premeasure on the ring R. O 
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3.2 Remarks and Definitions. 


(2) Theorem 3.1 states that the Lebesgue content on R(S) in 
R” is o-additive. This, obviously, implies that the Lebesgue elemen- 
tary content is also a-additive on S. 


(it) In Example 1.2(iz) we defined the Lebesgue elementary 
content ° on the semi-ring S of half-open intervals in R”. Now, by 
the use of Proposition 2.11, Corollary 2.14, and Theorem 3.1, we can 
have the couple (R, A.) or, in light of Remark (2), even (S, \°) as an 
extendible formatter of the outer measure X* acting on P(R"). We 
call this set function the Lebesgue outer measure. The o-algebra 
d/* C P(R”) of all A*-measurable sets, in the specifically designated 
notation £L*, is called the Lebesgue o-algebra of measurable sets. 
The corresponding restriction of A* to £*, Aj = Res;-A*, is called 
the Lebesgue measure. The Lebesgue measure space (R", L*, Xj) 
will form a complete measure space, as per Proposition 2.7. The fur- 
ther restriction A = Resyis)A* = Resgipn)A* of the Lebesgue outer 
measure to the smallest o-algebra generated by S (which, according 
to Theorem 2.8, Chapter 4, is identical to the smallest o-algebra 
generated by the usual topology) or, equivalently, by #, known as 
the Borel o-algebra 6 on R”, is referred to as the Borel - Lebesgue 
measure. The reader should be aware of the weighty differences 
between these two measures and measure spaces, often confused 
with one another in the literature on measure theory. 


By noticing that there exists a monotone increasing sequence 
(—k,k]"} R” of half-open squares with 


Pac _ kl) — Qrpn < oo 


we conclude that A° is o-finite on S and, therefore, by Corollary 
2.14, the Borel- Lebesgue measure » is unique on B. By Remark 
2.24, the Lebesgue outer measure A* and hence the Lebesgue mea- 
sure Aj are also unique on P(IR”) and L*, respectively. Finally, by 
Theorem 2.19 (zz), the completion of Borel- Lebesgue measure 
coincides with Lebesgue measure Aj on £* and the corresponding 
completion of the Borel o-algebra coincides with the o-algebra L* of 
Lebesgue measurable sets. 
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Both Lebesgue and Borel-Lebesgue measures have their 
strengths and weaknesses. The Borel - Lebesgue measure operates on 
the Borel o-algebra, which stems from the usual topology and pre- 
serves reasonable topological properties. Furthermore, the Borel - 
Lebesgue measure is an element of a very important class of Borel 
measures. However, unlike the Lebesgue measure, the Borel - 
Lebesgue measure is not complete. O 


3.3 Definitions. 


(i) Let B = Xr) bea Borel o-algebra in 2. Any measure ju on 
B is called a Borel measure and the triple (Q, B, j1) is called a Borel 
measure space. 


(ii) Let (Q,d) be a metric space and B be the Borel o-algebra 
induced by the metric topology. A Borel measure pz on BG is said to be 
a Borel - Stieltjes measure if ~(B) < oo for any d-bounded Borel set 
B. (More on Borel - Stieltjes measures in Problem 3.17.) 


A Borel measure p on (R", B) is said to be a Borel - Lebesgue - 
Stieltjes measure if p(B) < oo for any d--bounded Borel set B. 
Clearly, any Borel - Lebesgue - Stieltjes measure is o-finite. In other 
words, a Borel- Lebesgue - Stieltjes measure is a Borel - Stieltjes 
measure on G that is generated by the Euclidean (usual) metric topol- 
ogy. 

(277) Let ~ be a Borel - Lebesgue - Stieltjes measure on (IR”, B). 
Now, in light of Carathéodory's construction we can use the couple 
(B, 1) as an extendible formatter of the outer measure j* operating 
on P(R”) and call this set function the Borel outer measure. The o- 
algebra ¥’/* C P(IR") of all ju*-measurable sets is denoted by Bi, and 
called the Lebesgue - Stieltjes o-algebra of measurable sets. The 
corresponding restriction jj = Resg:p" is called the Lebesgue - 


Stieltjes measure. In the literature on measure theory, the Lebesgue - 
Stieltjes measure is often confused with the Borel - Lebesgue - 
Stieltjes measure. O 


In addition to the Borel - Lebesgue measure on Borel o-algebra 6 
on R”, we present another construction of a Borel - Lebesgue - 
Stieltjes measure, for simplicity letting dimension n = 1. In Example 
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1.2(zi7), we introduced the Lebesgue - Stieltjes elementary content 
Ly on the semi-ring S of all half-open intervals (a, 6] C R, by means 


of an extended distribution function (i.e., a monotone nondecreasing, 
right-continuous function) f: (IR,d.) — (R,d.), as py ((a, bj) 


= f(b) — f(a). Obviously, Ly reduces to the Lebesgue elementary 
content if f(a) = «. According to Proposition 2.11, Oy can uniquely 
be extended to the Lebesgue - Stieltjes content 1+ on the ring R(S) of 
“figures.” The following is to show that ju is o-additive. 


3.4 Theorem. Let ji; be a Lebesgue - Stieltjes content on the ring 
R(S) induced by a monotone nondecreasing right-continuous 
function f. Then 1s is a premeasure. 


Proof. Because py is finite on R(S), as in Theorem 3.1, it is 
sufficient to show that ju; is @-continuous. Let {F,,} be a sequence 
of sets from #(S) monotonically decreasing to O. We prove that 
jim fp(Rn) = 0. 


We assume that R, CC, n=1,2,..., where C’ is a compact 
set in (R,7.). A set R, € R is a figure if it is a finite union of 
disjoint intervals of type (a, b]. Because of right-continuity of f, it 
can be easily shown that, for each fixed ¢ > 0 and for any figure R,,, 
there is a subfigure B, C R, such that B, CR, and such that 


Ut(Rn) — bp(Bn) <€27". It also follows that () B, =@. We 


n=1 


claim that there is an r such that (] B;, =@. To see this, observe 
k=1 

that {C’\.B, =C 1 (B,)*; n= 1, 2,...} is an open cover of C in 

the relative topology (C, tC). Because compactness is weakly 

hereditary and C’ is closed, it follows that C’ is also compact in 

TC. Thus, the above cover reduces to a finite subcover, say 

C\B,,..., C\B,, yielding that 


ir 


C=U(C\B) = C\( 1) Ba). 


k=1 


Thus, 
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Now, for all n > r, 


and 
fis(Ra) = Hy(Ba\ 7) Bu) + 447( 1) Br) = Hy Ra\ Br): 


In as much as { R,,} is monotone decreasing, it follows that 
Rn\( 1) Be) = U(Ra\Br) S U Re\ Br). 
k=1 k=1 k=1 


Observe that this is the desired inclusion implying the estimate 
ur(R,) <e¢. This inclusion is due to the inclusion R,,\B, C 
R,\By, which holds for all k < n (as long as n < 00). Hence, the 
above countable intersection reduces to a finite intersection of the 
sets B,, k=1,...,r. Thus we have 


n 


[f(Rn) = wp(Rn\ (1 Be) 


nm 


< s4p(U Re\ By) < Do yy(Re\By) < (1-27), 


which implies that ju;(R,) — 0. O 


It can alternatively be shown that the Lebesgue - Stieltjes 
elementary content is o-additive with no prior extension to the 
Lebesgue - Stieltjes content and bypassing @-continuity. 


3.5 Remarks. 

(2) Using the same arguments as in Remarks and Definitions 3.2, 
we will extend the Lebesgue - Stieltjes elementary content Ly (or 
content ji) from the semi-ring S (or ring R(S), respectively) to the 
Lebesgue - Stieltjes measure 1; on the o-algebra &* (= Bi) of 
Lebesgue - Stieltjes measurable sets and then reduce it to the unique 
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measure j4f, which is clearly a Borel - Lebesgue - Stieltjes measure 
on the Borel o-algebra B(R). 


(22) When dealing with Borel measures, it is common to observe 
a certain property of a o-finite Borel measure /1; on the semi-ring S 
in IR” and extend this property of ju; from S to the Borel o-algebra 6 
arriving at another Borel measure ju. Because 1; and juz coincide on 
S, by Corollary 2.14, j4) = pg on B. Consequently, by Remark 2.24, 
the corresponding outer measures j1} and ju5 must coincide on P(R") 
as well as their restrictions on By = B. Note, however, that 67 is 


not a general notation as B is, for it is not induced by the usual 
topology and it is related to a particular Borel measure ju on B. 


(272) We have learned that if f is an extended distribution 
function [see the definition in Example 1.2(227)], then it induces a 
Borel - Lebesgue - Stieltjes measure on 6. Conversely, a Borel - 
Lebesgue - Stieltjes measure jz generates an extended distribution 
function. 


If 4 is a finite Borel - Lebesgue - Stieltjes measure on B, then we 

can set f(x) = u((—co, z]) and such an f is a distribution function. 
Indeed, take a sequence 4, > @) >--- — a. Then f(z,) — f(x) 
= 1((x, £,]) — 0, by O-continuity of 4 [Theorem 1.7 (2)], which 
shows that f is right-continuous. Since ju is a finite measure, f is 
bounded. Finally, if x,, is any monotone decreasing sequence conver- 
gent to —oo (such as { — n}), then, again by @-continuity of ju, it 
follows that p((— oo, x, ]) and thus, f(z) — 0. 


If ys is an arbitrary Borel - Lebesgue - Stieltjes measure, we can 
define f(0) = 0 and 


_{ W@,2), © >0 
a — p((x,0)), <0. a 


Similarly, one can show that f is an extended distribution function. 
(See Problem 3.3.) If & = S(R,B) denotes the set of all Borel - 
Lebesgue - Stieltjes measures on (IR, B), then it can be shown that 
any two extended distribution functions f; and f2 that induce pp € B 
can differ only in an additive constant (see Problem 3.4). The latter 
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generates an equivalence relation among all extended distribution 
functions, say E. So, the distribution functions that differ only in an 
additive constant form a class associated with one such measure . 
Therefore, if ©, denotes the set of all extended distribution 
functions, for each j1 € %, there is a unique equivalence class { f; ju} 
of all extended distribution functions that induce ju, such that 
{fru} ={f +e:c € R}. Let D.|E: = {{f; u}: © B} be the cor- 
responding quotient set of D.. Consequently, there is a bijective map 
D from set B onto set D,|E. 


As regards the subset 8, of all finite Borel - Lebesgue - Stieltjes 
measures, then, obviously, each one of them generates a unique 
distribution function and there is a bijective map between %8,, and the 
set D ( C D-) of all distribution functions. 


To make all distinctions between distribution and extended 
distribution functions lucid the reader may find it plausible to go 
over Problem 3.9. O 


We return to the Lebesgue measure \j on L*. First, we prove a 
lemma about negligible sets. One of the interesting consequences of 
this result is that in IR”, all Borel sets having a dimension less than n 
are null sets. 


3.6 Lemma. 4 set N C R” is \-negligible if and only if for each 
€ >0 there is a countable cover {I,} CS of N by semi-open 
intervals I;,'s such that \yy°_,A°(Ik) < €. 


Proof. Let N be \-negligible. Then, by Problem 2.18, A*(V) = 
0 and 


0 = A(N) = inf{(0 Ae): {et} © Ht, 


where @j is the set of all countable covers of N by semi-open 
intervals and it is nonempty, because otherwise A*(V) would equal 
oo. By the definition of a limit inferior, for each ¢ > 0, there is a 
cover {Ij,} € G@ such that 


M(N) +6 > VPA Tk), 
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which proves the first part of the statement. 
Conversely, let ¢ > 0 and let {J,} C S be a countable cover of 
N with the property that )7°°_, \°(I;,) < e. Then, 


M(N) < (Ui) < DEA) <= 


and hence, by Problem 2.18, N is a A-negligible set. O 


3.7 Lemma. Let f:.R — R be an additive function, continuous 
at zero. Then, f is linear. 


Proof. First note that 


FO) + FO) = FO + 0) = f(0). (3.7a) 


This yields that f(0) = 0. Then, from 


0= fO)= fw@—2)=f@+ f(-2) (3.7b) 


it follows that f(x) = — f(— <x) and thus f is odd. Now, let n be 
any positive integer number. Then, in as much as f is additive, 


f(nx) = nf (a). (3.7¢) 
If n is a negative integer, then, from (3.7b -c), 
f(nz) = f((— n\(— 1)a) 
= —f(—nar)= —(—n)f(@) = nf). 


Hence, for each n € Z, 


J(ne)=ni(o), (3.7d) 
which yields that 


f@) = 1/@). 3.7 
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Combining (3.7d) and (3.7f) we have that for each integer m, 
mf) = fG2) = TF). 
In other words, for each rational number gq, 
f(qx) = qf (@). 


Because f is continuous at zero and because f is additive and odd 
we have from 


fa@-w=f@+h-wy=fwo- fy 


that f is continuous on R. Now, let r € R. Then, there is a sequence 
{dn} of rationals convergent to r. Due to continuity of f, 


tim f(4n) = £0). 3.78) 


On the other hand, f(q, - 1) = dn f(1) and (3.7g) lead to 
f(r) = lim fn) = fA) Tim an = f(r. 


This shows that f is a linear function f(x) = cx, where c = f(1). O 


3.8 Corollary. Let f:R” — R be continuous at zero and 
additive for each variable separately. Then 


Fie ay hie) = CH a, were 6 = FUL .a54. 1h 


Proof. If x2,...,Z, are fixed, then by Lemma 3.7, 


PGi) = Wy ce 


Applying the same procedure successively to the other variables we 
have the statement. O 


3.9 Definition. Let L : R” — R” be a bijective function such 
that L(y) = y+x, for a fixed x € R". A Borel measure jz on 
BIR") is said to be translation-invariant, if for each Borel set 
B € BR") and x € R", w(L,(B)) = p(B), where L,(B) = B+ a 
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={r+y:y € B}. Now, L,.(B) = B+ x is Borel, because L is an 
affine (that is, continuous) transformation. O 


We show in Section 4 that the Borel - Lebesgue and Lebesgue 
measures are translation-invariant. The following theorem states that 
any translation-invariant Borel measure is a multiple of the Borel - 
Lebesgue measure. 


3.10 Theorem. Let «i be a translation-invariant Borel measure 
on B(IR"). Then, 4 = cr, where X is the Borel- Lebesgue measure 
on B(IR”) and c = WC) (C stands for a unit cube). 


Proof. For each x € R, define 


[a Oy... 20 
i= QO, r= 
[0,z), «>0 
and 
ae ky ae) 
ee) eh ge OF 
Denote 


f(@1,-.-;y) = sgn (TI) (Tl). (3.10) 


We show that f defined in (3.10) is additive and continuous in each 
variable separately. Without loss of generality, we show it with 
respect to: 21. Leta; = 2 4+ y, 


Case 1. Suppose x > 0 and y > 0. Then, 
Try _ [0, x +y) _ [O, x) + eae 2 “a y) 


and 
ee = AR, + Ro, 
i=l 

where 
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i = de & Ip & oe ® Le, 

and 
Rg = |v, 0 + Y) * digg X00 ¥ Le 


In as much as x,y, and x + y are all positive, 


sen( Tle) = sen(x - £2 --- Lp) 


i=1 
= sen(y - £2°-- Ln). (3.10a) 
From (3.10a), 
FM er a) PC a ee 
= sgn(x - £2 --- Lp) uC) = sgnly - £2 --- Fn) uCR2) 
and because yu is translation-invariant, 
PBs 085 Bp) = FCs cecy ag Pay cts a): 
Case 2. Suppose x + y > Oand x > 0, y < 0. Then, 
sgn((x + y) - £2 --- Xn) 
= sen(% - XQ ---Lp) = — sen(y - Lo---Ln). (3.10b) 


Because 
Tp+y = [O, x ar y) = [0, x«)\[x or Y; x), 


we have 
A([z + y, x) = A([y, 0)), 


and because yu is translation-invariant, using (3.10b) we have that 
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FCP YI ng Ep) 
= sen((% + y) - £2--- Ln) 
fille K Tg, Moo Ty) = iy 1p, MO Bde) 
= Seria ages, ite ® dy Le) 
+ sgn(y - %2--- Up )pLy x Lng X +++ X Lr,,) 


= JE Piero s Uy asset a): 


Case 3.2 +y >Oand ax < 0, y > 0 are the same as in case 2. 


The other combinations of x and y are left for the reader. (See 
Problem 3.23.) 


Now, we prove continuity of f at zero. Let {a,} be a sequence 
convergent to zero from the right. Then, {a,} CR, and the 


sequence of sets {J,,,} is such that (] J, = {0}. The latter yields 
k=1 


that 
(Vlg & dey Se Ty pS a Oe ee, 
k=1 


By the definition, J) = @; and by continuity from above of ju, we 
have that 


jim J(Gh. a, coogi) =U, 
Similarly, by continuity from below of ju, we have that 
lim f(a, £2, owe Zn) =0 
k — co 


for {a,}7 0. In addition, f(0,22,...,2) = 0 holding true by the 
definition of f. 
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By Corollary 3.8, 


Fite eT ln Ee 


= _ 1) 21+ -tpp(C), (3.10c) 


where C' = [0, 1) x --- x [0, 1). On the other hand, 


Giese) = sgn (II) Le (T1z:,) ’ 


which, along with (3.10c), gives 


u (Ite, =e UO (3.10d) 


Note that 
in 
 (T IL. = a eR aay (3.10e) 


Equations (3.10d) and (3.10e) tell us that for any rectangle R whose 
all sides lie on corresponding coordinate axes, 


(R) = u(C)A"(R). (3.10f) 


For an arbitrarily positioned rectangle R all of whose sides are 
parallel to the corresponding coordinate axes, (3.10f) still holds true 
due to the translation-invariance of ju. 

By [9 = u(C)A° we define an elementary content on the semi- 
ring S of half-open rectangles. Then, by 77 = ~(C)A we also have a 
Borel measure on B. Now, we have three Borel measures on 6: 
Zi ,p, and the (unique, as S is M-stable) extension of jo from S to 
B. All three coincide on S and therefore must be equal on B. O 


3.11 Example. (Cantor ternary set). Consider the following 
family of subsets of [0, 1]. Let 
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Q= Co = (0, i}, Gi = (a4 2), Cy = Co\Gi, 


Go = ( ’ eee 3), Cy = C\Go, 


ole 


as depicted in Figure 3.1 below: 


0 G, l 
| 
1 G, = O7.(1) 
3 3 
0 F 1 2 F, 1 
EEE? 3 41 
i: OCs i O,(2) 8 
9 9 
F(2) F,(2) F,(2) F,(2) 
=——a =——_a 
Figure 3.1 


Therefore, each C,, is the union of 2” closed intervals, and each G', 
is the union of 2”~' open intervals. Also, 


gntl 


Qn 
(eare = Cr\Gn41 = Cay U Or (n + 1) = U Fy, (n) 
k=1 k=1 


and {C;,} is a monotone decreasing sequence of sets. The Cantor set 
is defined as 


[o.6) 
C= TG, 
n=1 
and it can be characterized as follows. 


1) C is closed as the intersection of closed sets. 
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2) Each C,, contains 2” closed disjoint intervals F\(n),..., 
F.(n). Each of these intervals is a term of the monotone decreasing 
sequence {F,(n)}| with d.(F,(n)) = A(Fi.(n)) | 0, n — co. By 
applying Cantor's Theorem 5.4, Chapter 2, we conclude that V 


lo) 
k =1,2,..., (| Fi (n) consists of exactly one point. In other words, 
n=1 


C is a union of isolated points and therefore nowhere dense. 
3) Ge = Co\ Ge = (U Gr) 
k=1 k=1 


He=C |e? — iG = > 
n=1 


n=1 n=1 k=1 


5) The Lebesgue measure of G,, is \(G,) = 5(4)", because 
MGi) =}, MGs) = 2k, MGs) =48 = HQ)... 


MG) = CRaMGn) = § Deo(G)" = 1 
Hence \(C’) = 1 — 1 = O and therefore C’ is a Borel \-null set. 


7) C is not empty, because C' contains all boundary points of the 


p a oe oe ee ee 
sets Cy2 0) Ly aa gy Grier aes Grete % 


These boundary points have the following ternary representations: 
0 = 0.0000... 
1=1.0 (or) = 0.22222... (in dyadic representation) 


+ =0.1 (or) = 0.02222... 
= 020122 2: 

* = 0.01 = 0.00222... 

7 =U,02 = 001224. 

£ = 0,21 = 0.2022... 
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Every set C;, has exactly 2” boundary points, each one of which has 
a unique triadic representation consisting of all n-tuples of digits 0 or 
2. Observe that 2” = |P(A)| where A is an n-element set. Therefore, 
C is equivalent to the set of all subsets of natural numbers which, by 
Remark 6.5 (7), has the cardinality of the continuum. In other words, 
C= R. Therefore, the Cantor set is an example of a uncountable 
Borel A-null set. O 


The following regularity notions are used in several problems 
below and will be revisited in [Dsh2] for more general measures on 
topological spaces (with a whole chapter dedicated to this theme). 


3.12 Definitions. Let ; be a Borel measure on a Borel measur- 
able space (92, B(7)) such that 7 is a Hausdorff topology on (. 


(i) A Borel set A € B = B(r) is called 
(a) p-outer regular if (A) = inf{u(O) : A C O € TH, 


(b) p-inner regular if (A) = sup{u(K): AD K € k} 
where K(Q) is the family of all compact subsets of Q, 


(c) p-closed regular if (A) = sup{u(F'): AD F € T°}. 


ii) A Borel measure pu is outer (inner or closed) regular on 
§ 
G C B(Q) if all sets of G are j1-outer (j1-inner or j1-closed) 
regular. 


(iii) jcis Radon if ju is 
(a) finite on K(Q); 
(b) outer regular on B(Q); 


(c) inner regular on T. 


(iv) is regular (weakly regular) if it is outer and inner (closed) 
regular on B(Q). 


(v) is strongly regular if it is regular and finite on K(Q). O 
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PROBLEMS 


3.1 


3.2 
3.3 


3.4 


3.5 


3.6 


3.7 


3.8 


Let H = {x = (a1,...,2n) € R": x; =a € R} be a hyper- 
plane orthogonal to the ith coordinate axis. Show that H is a 
A-null Borel set. [Hint: 1) Show that H is closed in (R”, 7.) 
and hence Borel. 2) Find a relevant countable cover of H by 
rectangles from S and apply Lemma 3.6.] 


Show that each countable subset of IR” is a Borel \”-null set. 


Show that f defined by (3.5) in Remark 3.5(777) is an extended 
distribution function. 


Let f; and f2 be two extended distribution functions and let pu; 
and jl2 be the corresponding Borel - Lebesgue - Stieltjes meas- 
ures induced by these functions. Show that 1 = jl if and only 
if f: — fo =, where c is a constant function. 


Let 9, be the set of all extended distribution functions. Show 
that D, is a semivector space over R, 


Let f; and fo be two extended distribution functions. If 4, and 
jig are the corresponding Borel - Lebesgue - Stieltjes measures 
induced by f; and f2, show that for any nonnegative scalars 
a, and ag, D(aypi + a2p2) = {ai fi + a2 fo; 4}, where D 
is defined in Remark 3.5(i22) and w = aypy + AofL2. 


Let f: R — R be an extended distribution function and let ju; 
be the corresponding Borel - Lebesgue - Stieltjes measure on 
B(R). Show that 


a) ((a,b)) = f(b—) — fla). 
b) wp([a, b]) = f(b) — fla —). 
c) ws (la, b)) = f(o—) — f(a—). 


d) f is continuous if and only if w,({x}) =0, c ER. 


Let f be the extended distribution function on R given by 
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3.9 


3.10 


3.11 


3.12 
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— 1, oe 2 

1+a, —2<2<0 

F(z) 24 0<2<3 
ee oe 


and let py be the corresponding Borel - Lebesgue - Stieltjes 
measure. Evaluate the measure of the following sets. 


a) {3} b){-2} c)[-1,3) d) [0,5)U (3,5). 


Let f be a distribution function and let jf denote the Borel - 
Lebesgue - Stieltjes measure induced by /. Justify with a proof 
or give a counterargument: 


a) Must f be an extended distribution function? 


b) Suppose g is a function defined by (3.5) of Remark 3.5 
(277). Is g a distribution function? If your answer is yes, is 
g= J! 
Let ys be an atomic measure ( = 37° )ai€0,). 


1) Is always a Borel - Lebesgue - Stieltjes measure? If it is 
not, give a condition under which ju is a Borel - Lebesgue 
- Stieltjes measure. 


2) In this case, give { f; ju}. 
3) Plot one such f. 


Consider the Borel o-algebra 6 = B(IR") generated by the 
usual topology. Show that, for any Borel set B € B and any 
pointe ER", B+exr={zeER z=ytu ye BleB. 


Let (Q, B, 4) be a Borel measure space, such that the Borel o- 
algebra B is generated by a Hausdorff topological space 7, and 
jt is a finite Borel measure on B. For any subset Q C 2, denote 
by K(Q) the collection of all compact subsets of Q. Show that 
the family M C B ofall inner regular sets B € B, i.e., with 


w(B) = sup{u(): K € K(B)}, 
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3.13 


3.14 


3.15 


3.16 


3.17 


3.18 


3.19 


3.20 


forms a monotone system in (2. 


Let (IR, B(R"), 14) be a finite Borel measure space (i.e., ju is 
finite). Show that ju: is weakly regular, that is, it is outer and 
closed regular on B(R"). 


Let (IR, B(R"), 4) be a finite Borel measure space. Show that 
ju is strongly regular. 


Suppose that (IR”, B(IR”), 41) is a o-finite Borel measure space. 
Show , is inner regular. 


Let (R”, B(R"), 4) be a finite Borel measure space. Use just 
the inner regularity of jz to prove that yu is outer regular. 


Let (Q,d) be a metric space and B be the Borel o-algebra 
generated by the metric topology. Furthermore, let jz be a Borel 
- Stieltjes measure on B. Is 4 o-finite? 


Show that a o-finite Borel measure 1 on (IR, B(R)), need not 
be outer regular. 


Let (R",B(R"),) be the Borel-Lebesgue measure space. 
Show that the Borel - Lebesgue measure is strongly regular. 


Let denote the Borel- Lebesgue measure on the Borel o- 
algebra BR"). Show that for each Borel set B and ¢€ > 0, 
there is a countable cover of B by disjoint half-open cubes 
{C;,} such that 


yo A°(Ce) — A(B) < €. 
In particular, 


A(B) = inf {S0—, A°(Cz): {Cu} € E(cubes)}. 


(Hint: Use Problem 3.19.) 
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3.21 


3.22 


3.23 


3.24 


3.25 


3.26 


3.27 


3.28 
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Show that if N is a negligible set in (R",B,A), for each 
€ > 0, there is a countable cover of N by disjoint semi-open 
cubes {C;,} such that 


ye (Ce <.E; 


Show that if NV is a subset of R”, and for each ¢ > 0, there is 
a countable cover of N by half-open (not necessarily disjoint) 
cubes such that 


Ye ee) <€E, 
then N is negligible. 


Show additivity of f in Theorem 3.10 for the other combi- 
nations of x and y. 


Let ys be a translation-invariant Borel measure on B(IR”) and 
let 4." be the outer measure produced by (B(R"), ju) and B7, be 


the corresponding o-algebra of j:*-measurable sets. Show that 
(@) =p = W(C)A* on P(R") and fy = W(C)AG on Br, where 
C’ is a unit cube. 


Gi) Br=Lr. 


Let (R”, B, X) be the Borel - Lebesgue measure space and let B 
be a compact set. Show that for each ¢« > 0, there is a finite 
cover of B by half-open rectangles D,,..., Dy such that 


SAD) 6 < MBy< ADD. (3.25) 


For any ¢ > 0, construct an open set D in (R,7,), which is 
dense in R and with A(D) < e. 


Is every o-finite Borel measure on B(R”) also a Borel- 
Lebesgue - Stieltjes measure? 


Show that there is a non-Borel set in P(R). 
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NEW TERMS: 


Ac, Lebesgue content, 313 

A*, Lebesgue outer measure, 316 

L*, Lebesgue o-algebra, 316 
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(R”, £*, A5), Lebesgue measure space, 316 

A = Resyys)A* = Resgcpr)A*, Borel - Lebesgue measure, 316 

Borel measure, 317 

(Q, B, w), Borel measure space, 317 

Borel - Lebesgue - Stieltjes measure, 317 

Borel outer measure, 317 

Lebesgue - Stieltjes o-algebra, 317 

Lebesgue - Stieltjes measure, 317 

Lebesgue - Stieltjes elementary content, 317 

Lebesgue - Stieltjes content, 318 

figure, 318 

distribution function, 320 

extended distribution function, 320 

®.|E: = {{f;u}:u € B}, quotient class of extended distribution 
functions, 321 

8,., finite Borel - Lebesgue - Stieltjes measures, 321 

translation-invariant Borel measure, 323 
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inner regular measure, 330 

closed regular measure, 330 

Radon measure, 330 
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strongly regular measure, 330 

measure of a hyperplane, 331 

non-Borel set, 334 
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4. IMAGE MEASURES 


In Remark 3.5(277) we saw how Borel - Lebesgue - Stieltjes measures 
can be generated by measurable functions belonging to the class of 
extended distribution functions D,. We begin this section by generat- 
ing measures by elements of C~'-class of measurable functions 
(more general than ,) and then illustrate such constructions in 
Euclidean spaces. The means of generation of measure here is totally 
different than those in Remark 3.5(zi7) and the two notions should 
not be confused with each other. Section 3, Chapter 4, is a relevant 
prerequisite to this material. 


4.1 Proposition. Let (Q, 2’, 41) be a measure space and let f : 
(Q, 37) > (Q!, ©) be a measurable function. Then the set function 
Al & puf*(A’) = p(f*(A’)) is a measure on 3". O 


(See Problem 4.1.) 


4.2 Definition and Notation. The measure j f* in Proposition 4.1 
induced by a measurable function f is called an image measure. 
Notice that directly from Definition 2.1(v227), Chapter 1, of the 
inverse image of a set, ju f*(A’) can alternatively be viewed as 
pi{w € Q: fw) € A’} or, in short, as p{f € A’}. O 


4.3 Proposition. Let L: R" — R" be such that L(x) = ax +b, 
where a € R\{0} and b € R". Then the Borel - Lebesgue measure 
on B(IR") has the property \L* = 4, . Specifically, if a =1 we 


— lal” 


have L* =X, which shows that the Borel - Lebesgue measure is 
translation-invariant. 


Proof. 


1) Let f(x) = ax (called a homothetic function), where x € R” 
and a( #0) €R. Let A be the Borel-Lebesgue measure on the 
Borel o-algebra 6. We first show that 


Aft= a 


on S. Take (a, b] € S. Then, 
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which implies that 


Af*((a,6]) = 4+ A((a,b]) fora > 0 
and 
Af*((a, 6]) = 4(-1)"A((a, b]) fora < 0, 


Qn 


and thus 


As a continuous map relative to the usual topology, f is Borel and, 
thus by Proposition 4.1, Af* is a Borel measure on B. Obviously, 
a> is also a Borel measure on B. Because A f* and a coincide 
on S and clearly are o-finite and o-additive on S, by Corollary 2.14, 
they should also coincide on B. 


2) Let g(x) = «+b. Similarly, we can show (see Problem 4.2) 
that Ag* is a Borel measure on G and furthermore that Ag* = X. 
Therefore, 2 is translation-invariant. Finally, 


L= 907 ond AL =Ay cg" = ar ag = ar 


1 
la 


The proposition is therefore proved. O 


4.4 Remark. From Proposition 4.3 we learn that the Borel - 
Lebesgue measure is invariant under translation (a particular case of 
motion) defined as T,(2)=a+a. In the two-dimensional 
Euclidean space we know another form of the motion, called 
rotation. A figure under rotation R and subsequent translation T;, is 
transformed into a congruent one. We will show that an arbitrary 
Borel set in R” “rotated” and then translated preserves its volume 
[Dsh2]. 


In the n-dimensional Euclidean space, in place of rotation, we 
use an orthogonal transformation. More precisely, in IR” an ortho- 
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gonal transformation is in the form of an orthogonal n x n matrix; 
recall that an nxn nonsingular matrix A is orthogonal if 
AA'=A'TA=I (the identity matrix). The composition M = 
T,,0 A is an example of a motion. Generally, a bijective map M 
from one metric space (X,d) onto another metric space (Y,p) is 
called an isometry if it preserves the distance, that is, if for every pair 
x,yEeX, d(x,y) = p(M(x), M(y)). Two such metric spaces are 
said to be isometric. A motion is thus a special case of isometry 
when Y = X, p =d. Indeed, it is well-known that an orthogonal 
matrix A preserves the distance (||A(zx)||, = ||Azl|, = |lzll., 
x € IR”) and so does the translation. In the Euclidean space (R", d.), 
a motion M can be represented by the composition T,, o A, where 
T,, is a translation map and A is an orthogonal matrix. As a 
continuous map, the motion is also Borel. It can be shown (see 
Problem 4.10) that the Borel - Lebesgue measure is motion-invariant, 
that is, AM* = 2X. LC 


4.5 Examples. 


(i) Let (Q, 3’, 4) be a probability space and let (’, ©”) be a 
measure space. Then any 4’-»’-measurable function f: Q — 1’ is 
called a random variable (in short, r.v.). The corresponding image 
measure yf* is called the probability distribution (of the r.v. f). 
Observe that in probability theory, a probability measure is denoted 
by P and a random variable is denoted by uppercase letters such as 
X,Y, or Z. In most applications, 9’ is the numeric set R” or a 
subset of R”, and X” is the corresponding Borel o-algebra B(R") or 
its trace on the subset. We emphasize that a r.v., say X, is not just a 
measurable function. It can only be a tr.v. if it is associated with a 
particular probability measure P, along with which it induces the 
probability distribution. The latter then specifies the r.v.. In other 
words, measurable functions may share the same measurable space, 
but as far as probability theory goes, they differ if they generate 
different probability distributions (or more precisely, different 
classes of probability distributions categorized by their parameters). 
Furthermore, as we show later, r.v.'s that induce the same probability 
distribution may still differ, but they form equivalence classes. 
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(it) Let (Q,2’,P) be a probability space and X:Q— {0,1, 
.. } bear.v. such that P.X™* is a Poisson measure with parameter a: 


PX Say =) oe ens @ > O: 


n! 


Then the r.v. X is called a Poisson r.v.. Similarly, a rv. X: 
Q — {0,...,n} is called binomial, if PX* is the binomial measure 


= Bap = a aie ') pt (1 — p)"*ex, PE (0, 1). 


A rv. X is called (discrete) uniformly distributed if PX* = 

r-1-ék- AS pointed out in (i), X: Q— {0,...,n} is just a 
measurable function (which can be uniform or Ginonial), and it 
becomes a r.v. upon specification of its distribution PX* or even 
prior to this, the probability measure P. 


These are examples of so-called discrete r.v.'s. The construction 
of probability distributions of continuous t.v.'s (i.e., those whose 
ranges are continuums) requires integration and the concept of a 
density. The latter is developed in Chapter 6. 


(iz) Let (Q, 2’, P) be a probability space and X; be an (Q, »’)- 
(Q;, 57;) rv., i= 1,...,n. Now, each of the n r.v.'s generates a o- 
algebra, o(X;): = X}*(2;) C &. The r.v.'s X1,..., Xp are said to 
be independent if the o-algebras {o(X1),...,7(X,)} are indepen- 
dent. (For more details on independent families of events, see Prob- 
lems 1.18 - 1.22 of this chapter. O 


PROBLEMS 


4.1 Prove Proposition 4.1. Let (Q, 5’, w) be a measure space and 
let f: (Q, 3) > (Q', &”) be a measurable function. Then the 
set function A’ > wf*(A’) = p(f*(A’)) is a measure on &”". 


4.2 Prove part 2 of Proposition 4.3. The Borel - Lebesgue measure 
A on B(R") is translation-invariant. 


4.3 Let (Q, X’, 4) be a measure space with =R, © = {A CR: 
either A or A° <N} and let (A) =0 for AXN and 
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4.4 


4.5 


4.6 


4.7 


4.8 


4.9 
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p(A)=1 for A°TXN. Let O'=R, &’=B(R). Define 


0, if x is rational 
iF if x is irrational. 


Prove that f is 3/-»/’-measurable and determine pu f*. 


What are the traces of Borel c-algebras B(R) on 0! = 
{0,1,... } and Q! = {0,1,...,n} introduced in Example 4.5 
(it)? 


Let M(R”) be the collection of all motions on (R”, d.). Show 
that (M(R"), 0), where o is the composition operator, 
forms a group. 


Let f be a homothetic function (f(x) =azx) defined in 
Proposition 4.3, part 1, \ be the Borel- Lebesgue measure on 
BIR"), A* be the Lebesgue outer measure, L* be the c-algebra 
of Lebesgue measurable sets, and \;, be the Lebesgue measure 
on £*. Let y* be the outer measure generated by the image 
measure Af*, Bi be the o-algebra of all *-measurable sets, 
and ju, = Resg; j". Show that: 


a) p* = jal "A* on P(R"). 
b) pi = |a| "Aj on B*. 
c) BR = L*. 


Generalize Problem 4.6 by letting f be an affine map f(x) = 
axz+b,a#0, b€ R”. 


Show that the Lebesgue measure Aj on L* is translation-in- 
variant. 


Let yj be a translation-invariant Borel measure on B(R”) and 
let p.* be the outer measure produced by (6(R"), ju). Show that: 
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a) u* = pw(C)A*, where C' is the unit cube. 
b) Bi = L*. 


4.10 Let (Q,2°,P) be a probability space and X; be a (, 2)- 


4.11 


(Q;, 54) rv., i =1,...,n. Show that X,,...,X,, are indepen- 
dent if and only if c-algebras »’;'s have / -stable generators G; 
such that the system {Xj7*(Gi),...,X7"(Gn)} of families of 
events are independent. 


We know from probability theory that r.v.'s Xy,...,Xn 
defined on a common probability space (, 3’, P) and valued 
in ()4,...,Qn, endowed with o-algebras ¥,...,5’,, are 
independent, if for any n-tuple of measurable sets A;,..., An, 
P{Xy € Al, aa Xn € An} = {X1 = Aj}. Pi An € Ant 
How do you reconcile this version of independence with that in 
Example 4.5 (iii)? 
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5. EXTENDED REAL-VALUED MEASURABLE 
FUNCTIONS 


5.1 Definitions and Notations. 


(i) Recall (Section 3, Chapter 4) that C-1(Q, 27;0’, X”’, ) 
denotes the collection of all measurable functions from a measurable 
space (Q, 2’) to a measurable space ()’, %”). If Q’=R and 
+” = B(R), then 


C71(0, 2; R) =C71(0, 2; R, B(R)) 


will denote the class of all real-valued measurable functions on a 
measurable space (Q, »’). The class of all complex-valued meas- 
urable functions is denoted by 


C-1(9, 5;C) = C-1(9, 3; C, BC). 


Using the notion of product measures (Section 9, Chapter 6) we can 
show that a function f = u+iv €C7!(Q, X;C) if and only if u,v 
EC 1(Q, 2; R). 

(ii) The “extended real line R” that we utilize in this and all 


forthcoming chapters is formed by a “two-points compactification” 
procedure whose formalism here is reduced to a minimum. 


Firstly, we adhere {+00} and {—oo} to R arriving at the 
extended real line. [See also Example 1.2(iv).] Then, we form the 
collections of sets [—oo, b) and (a, oo] as a subbase for 7-. 


The intersections of such sets produce open intervals (a,b), 
which, along with [—oo, a) and (b, co], according to Theorem 2.8, 
Chapter 3, form a base for the topology that we denoted by 7,. This 
topology obviously includes all open sets of 7, and in addition, open 
sets of types [—o00, b) and (a, oo}. 

(iti) Now, ( —oo,b) and (a,oo) are open sets in 7,. It can be 
readily deduced that the sets [—00, 00) and (—o0, ov] are also open 
and thus the singletons { —co} and {oo} are closed which comes to 
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no surprise, since (R,7,) is Hausdorff where all singletons are 
closed sets. 


We also notice that closed sets in 7, need not be closed in 7,. For 
example, [a,0oo) is a closed set in 7. but not closed in 7,. This is 
because {co} is a closure point of [a, 00) (because each open neigh- 
borhood (c, oo] of {co} meets [a, co). 


(iv) Now, R is compact; firstly, because the spaces (F, 7. F), 
where F=[—%,3], and (R,7-) are homeomorphic through 
[F’, IR, Res;tan]. Secondly, we can see that any open cover {O;} of R 
must include open sets [—co,c) and (d, oo] to cover { —oo} and 
{oo}, respectively. If so, then the set [c,d] (without loss of 
generality, c < d) can be covered by {O;} excluding [—oo, c) and 
(d, co]. Because [c, d] is compact in 7,, the rest of the cover by {O;} 
is finitely reducible, thereby making |—o0o, c) and (d, oo], along with 


a finite subcover of {O;}, a finite subcover R. 


Like (R,7-), the compactified space (R,7.) is second countable 
and separable, with Q being the same dense subset. 


We notice that the above formalism of (IR, 7.) can be more rigor- 
ously elaborated, but the above is sufficient for the forthcoming 
applications. 


(v) The corresponding Borel o-algebra B(R) = X7(7,), there- 
fore, consists of all sets of B(IR) and combinations of unions of Borel 
sets with closed sets {++oo} and {—oo}. 


In this section, we are concerned with the class of all extended 


real-valued functions f: 2— R that are %-B(IR)-measurable, 
where »’ is a o-algebra in 12. We denote such a class by 


C-1(0, £;R) =C-1(0, £;R, BR) 
(or sometimes in short by C7! if a measurable space (©, 2’) is pre- 
viously specified). O 


We begin with a simple criterion for measurability of C~‘-func- 
tions. 
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5.2 Proposition. 4 function f € C~' is measurable if and only if, 
for every real value a, the set {w € 0: f(w) < a} = f*([—~, a]), 
in notation { f < a}, is measurable, that is, is an element of »’. 


Proof. We show that the collection of sets {[—oo, a]: a € R} isa 


generator of B(R). Then the statement follows directly from Propo- 
sition 3.4, Chapter 4. 


Denote B’ = 7({[—00, a]: a € R}). Then, 


(a, b] = [—o0, b]\[—00, a] € B’ 


and in as much as S C 6’, it immediately implies that B(IR) C B’. 
Because 


ane = 11h +00 snd mee = [-se ach: 


we have that {+00}, {—oo} € B’. Thus B(R) C B’. On the other 
hand, 


wz 


[— 00, a] = {—00} U (—00, a) U {a} € BR). 
Therefore, {{—o0,a]: a € R} C B(R), which yields that B’ C 
B(R) and, finally, B’ = B(R). oO 

Proposition 5.2 can be extended to a number of modifications of 
conditions equivalent to measurability. 


5.3 Corollary. A function f € C~! is measurable if and only if 
any of the following conditions holds. 


(i) {f ease, VaeR. 
(it) {f >a} eX, VaeR. 
(ii) {f <a} EX, VaeR. 


Proof. (777) 
{f <a} = f*([—00, a)) 
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ar U [-co, a— 1)) =U f*([=,4= *)) 


n=1 


=Utf<a-1} 


n=1 


ie., {f <b} © XY, Vb ER, and thus {f < a} € &. Similarly, 


(Fsa}=s| A\l-ca+4)| 


n=1 
oo 


=f) f*([-c00,a + 4)) = ff <at4}, 


n=1 n=1 


that is, {f < b} € 3’, Vb € R implies that {f < a}. 


Therefore, the statements, {f < a} € X’, Va ER, and {f < a}, 
Va € R, are equivalent, which along with Proposition 5.2, yield 
statement (i77). O 


[For the proof of (i) and (iz), see Problem 5.1.] 
5.4 Proposition. Let f,g © C7’. Then{f <g} eX. 
Proof. We show that 


{f <g}= Uti srinitg> ri 


by using the pick-a-point process. We exclude the trivial case when 
{f <g} =@. If wo € {f(w) < g(w)}, then equivalently, f(wo) < 
g(wo), implying that there exists an ro € Q such that f(wo) < ro < 
g(wo). Hence, 


; wo € {f(w) < ro} N {g(w) > ro} 


EPS 


Conversely, if 
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wo € Uti <rintg> ri, 


then there exists an ro €Q such that wo € {f(w) <ro}N 
{g(w > ro} and f(wo) < ro < g(wo), implying that f(wo) < g(wo). 
Thus wo € {f(w) < g(w)}. Now the statement follows from Propo- 
sition 5.3 (77, iz). O 

5.5 Definitions. On some occasions we deal with spaces of meas- 
urable functions that have not occurred before. We discuss the fol- 
lowing constructions. 


(2) Let F be a field and let VY be a vector lattice over F and a 
commutative ring with unity. Observe this makes (4, F) an algebra 
and (¥, - ) a multiplicative Abelian semigroup with unity; call it for 
short an ¥-space over F. Throughout the remainder of this book, as 
an -space, we consider a class of functions (extended real- or 
complex-valued over the field R or C). For instance, the space of all 
continuous functions is an ¥-space over R. (Note that the term 1- 
space is not common in real analysis literature and is restricted to the 
use in this book.) 


(iz) Let R” be the set of all functions from 2 to R (as we de- 


fined it in Section 5, Chapter 1), and let (R”, Tp) be the topology of 
pointwise convergence (cf. Definition 5.12, Chapter 3) generated by 


the compact topology (IR, 7.) in each of the factor spaces. Let us call 


(R®, Tp) the extended topology of pointwise convergence. Let Y be a 


subset of (R”, Tp) such that it is an V-space over R. We call Y an I- 
X-space if (Y, 7p) contains its all sequential limits, that is, the limits 
of all Tp-convergent sequences. (Notice that an /-V-space need not 
be closed.) In other words, it contains the limit of every pointwise 
convergent sequence of functions. Observe that because by Problem 


5.17, Chapter 3, (R”, Tp) is Hausdorff and Hausdorffness is heredi- 
tary, any pointwise limit in a subspace of (R”, Tp) is unique. For in- 
stance, the space of all continuous functions is not an [-4-space. 


(iit) Consider the subspace (C~!, tp) C (R®, Tp) of all measur- 
able functions structured in terms of the extended topology of point- 
wise convergence. The theorem below states that (C~', rp) is an I- 


348 CHAPTER 5. MEASURES 


X-space and it is the widest known class of functions (second to 
R”). Oo 

5.6 Proposition. (C~!, tp) is an I-X-space over R; that is, for 
any f,g EC * and for {f= 1,2,4..)CC 

(i) af+beECc!(a,beER). 

(ii) ftgec. 

(iii) f-gec. 

(iv) sup{fn} € (C7', tp) and inf{f,} € (C7 , rp); specifical- 
ly, it follows that C~' is a lattice, and thus with any f € C~', also 
lf ect. 


(v) lim fn = infr=i2,..sup{ fm: m > n} € (C7, tp). 


lim f;, — SUP pa1-9:.,. INET Jen? m = n} = Cc tp). 


(vi) If fn — f in the extended topology of pointwise conver- 
gence, then f € (C~',Tp). 


(vii) The space (C~', rp) is Hausdorff. 
Proof. 
(2) Is obvious. 


(ii) By (i), a—geC implying that {f+g<a}={f < 
g — a}. Therefore, by Problem 5.2 (i), f +g € C7. 


(iii) Firstly, {f? > a} =, ifa < 0. Then 
{f?>a}={f> Ja} U{f< — Ja}, fora>0, 


implying that {f? >a}e XY = f? €C!. The statement follows 
from the representation f - g = $(f + 9)? — 4(f — 4g)’. 
(iv) We show that 


{sup{fu} <a} = {fu <a}. 
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wo € {sup{f,} < a} if and only if sup{f,,(wo)} < a or equivalently 
fi(wo) = a, f2(wo) < A,..-, FUpe {fi(w) < a}, i= de Aeesag 


or, equivalently, w) € () {fn(w) < a}. The latter implies that 
1 


n= 


{sup{ fn} < a} = {fn <a}. 


n=1 


Let {fn} C C7. Then { — f,} C C7. The statement follows from 


inf{f,} = —sup{ — fr}. 


Now if f € C7!, it implies that |f| = sup{f, — f} € Col. 

(v) This statement follows directly from (iv). 

(vi) Jim. fn = f if and only if lim f, = lim f, = f, and the 
statement follows from (v). 


(vii) See Problem 5.9. O 


In the sequel, we need the notion of “properties that hold almost 
everywhere.” 


5.7 Definitions and Remarks. 


(i) Let (Q, 3’, 4) be a measure space. A property II (of points 
of Q) is said to hold almost everywhere (a.e.) or p-almost every- 
where (u-a.e.) if there is a (ji-null) set N € NV, (see Definition 
2.5(2), Chapter 5) such that IT holds for all points of N°. Notice that 
this definition does not preclude property II to hold on N or on its 
subset. It merely says that II may possibly fail on a negligible subset 
of N. 


(22) Two functions f and g are said to be equal a.e. if f = g on 
the complement of a p-null set N. In this case we say that f = g 
(mod jz). In the event f and g are measurable, by Problem 5.2 (izz), 
the set {f # g} is measurable. Therefore, with f = g a.e., the set 
{f Fg} © Nu ie., is p-null. 
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In general, however, if f = g (mod jz), where at least one of the 
functions is not measurable, { f 4 g} need not be a null set, although 
it is negligible, and thus { f 4 g} is a subset of a ji-null set. 


(iti) Let C~'(Q, 7; R) be the set of all measurable functions on 
Q and let 4 be a measure on %’. Let [f],, denote the set of all 
functions that are pairwise equal ji-a.e. on {2. Specifically, [0],, 
denotes the set of all measurable functions, which equal zero pU-a.e. 
on (2. Clearly, the p-almost everywhere property of equality of 
functions induces an equivalence relation (say E) on the set 
C-1(Q, 7; R). Then 


Che R)I,, »=C*(Q,8-R)|, (5.7) 


denotes the quotient set {[f],,: f € C7'\(Q, ©; R)} and it is called 
the quotient set modulo ,1. In light of these considerations, any two 
functions f and g such that f = g p-a.e. on 2 are also said to be 
equal modulo js and we write f = g (mod y), or f € [g],,, or equiva- 


lently, f —g € [0],. 


(iv) Finally, sometimes we use the notation [0}* = [0),9 cy, 


as the subset of all nonnegative measurable functions that equal zero 
ji-a.e., that we denote 

cyt =c1(Q, x, R,, B(R,)). (5.7a) 

O 


PROBLEMS 
5.1 Prove Corollary 5.3(2) and (72). 


5.2 Prove that for f,g EC", 
GO) 4fsgPe x. 
(ii) {f=gheX. 
(iit) {f Ag} eX. 
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5.3 


5.4 


5.5 


5.6 


3.7 


5.8 


a9 


5.10 


5.11 


5.12 


Let f,g € C-'. Show that w +> cos(f?(w) + 49(w)) € Ct. 


Show that if f? ¢ C7! then f € C7!. Show that if f? € C7! 
then f need not be in C"!. 


Let f,g € C-! and let A € ’. Show that 


h=flatgla €Ct. 


Let f:(a,b] + R be a) a monotone function, b) a convex 
function, or c) a function with at most countably many discon- 
tinuities. Show that in each case, f is B((a, b])-B(R)-measur- 
able. 


Prove the statement. f € C7! if and only if {f > q} € XY for 
alg EQ. 


Show that if f has a derivative at each point of R, then this 
derivative is Borel-measurable. 


Show that the space (C~!, rp) is Hausdorff. 


Let (Q,2’,4) be a complete measure space and let f € 
C-1(Q, X;R). Suppose that g: Q > R is an extended, real- 
valued function. Show that if g= jf (mod ju), then g€ 


C(O se), 


Let (Q,2',) be a complete measure space and _ let 
{fn} CC1(Q, Y;R). Suppose that lim, f, exists and 
fn — f pointwise pi-a.e. on 2, where f is an extended, real- 
valued function. Show that f € C!. 


Let Q be a non-Borel subset of R (such as the one in Problem 
3.19) and let C’ denote the Cantor ternary set. Define the 
function 


f(a) = leno(@)sinx + 1eney(@)2”. 
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5.13 


5.14 
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Is f Lebesgue measurable, that is, f € C~1(IR, L*, A)? 


We knew from probability theory that r.v.'s X1,...,X, ona 


probability space (Q, X’,P) and valued in R (each), endowed 
with the Borel o-algebra B(IR), are independent if and only if 
their joint probability distribution function (commonly abbre- 
viated as the PDF) P{X, < 2,..., Xp, < %,} equals the pro- 


duct [] P{X; < x;} of their marginal PDF's. Prove that this is 


i=1 


true using the original definition in Example 4.5 (777). 


Prove the statement: f € C7! if and only if {f > d} € ¥ for 


alld € D, where D C 


R is any dense set in R. 
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NEW TERMS: 


C(O, ©;R) = C1(Q, ©; R, B(R)), class of all real-valued 
measurable functions, 343 

c1(Q, ©; C) = C-1(Q, ©; C, B(C)), class of all complex-valued 
measurable functions, 343 

extended real line, 343 

two-points compactification, 343 

C1(Q, YR) = C(O, Y; R, B(R)), class of extended real-valued 
measurable functions, 344 

measurability of an extended real-valued function, criteria of, 345 

X-space, 347 

(R®, Tp), extended topology of pointwise convergence, 347 

I-V-space, 347 

ji-a.e. property, 349 

a.e. property, 349 

C-'(Q, ©; R)|,,, quotient set modulo ju, 350 

[0],,, set of all measurable functions equal zero p-a.e., 350 

[oy = [0], Cz, 350 

Cy" — Co, Di, R, , B(R )), 350 

independence criterion of numeric random variables, 352 
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6. SIMPLE FUNCTIONS 


The present section is a direct precursor to integration, which we 
develop in the next chapter. The integral itself is first defined for 
simple functions valued in a finite set of nonnegative reals. 


6.1 Definition. We consider the following subclass of functions 
from C~1(Q, 7; R), which we call nonnegative simple functions and 
denote this subclass by W.(Q, X’) = W.(Q, ©; R). An element s is 
said to belong to W, or to be nonnegative simple if: 


aj)s EC}, 
b) s(w) > 0, Ww EQ. 
c) s takes on only finitely many real values. O 


6.2 Remarks. 


(i) Lets € W,(Q, »’). If there is an n-tuple of nonnegative real 
numbers {a1,...,@,} and a finite decomposition )>/_, A; of Q such 
that s(w) = a,x for all w € Ax, then the function s can obviously be 
represented as 


s= >) aera. (6.2) 


(See Figure 6.1.) 


a, 
a, 
a, 
A 
A > he 


Figure 6.1 
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In the forthcoming constructions we deal with different partitions of 
Q. Consequently, there are different finite representations or 
expansions of s € W, of type (6.1). However, there is obviously a 
unique one where (6.2) contains all different values {a,,...,a,} of 
s. We call such a representation (expansion) canonic. 


(ii) We call the pair (V,F +) a semi-¥-space if it meets all 
specifications in Definition 5.5(i), except that 4 is defined over a 
semifield F. Recall that F, C F is called the semifield if all axioms 
of the field hold except for No. 4 (the existence of additive inverses; 
see Definition 7.5, Chapter 1). 


(277) In [Dsh2], we also use the notion of a simple function, 
which is just as in Definition 6.1, except that they are not necessarily 
nonnegative. The set of all such simple functions is denoted by 
w(O, 3’) = 0(O, 2; R). O 

6.3 Proposition. (W.(Q, 5’); Ry; -) is a semi-X-space. In other 
words, if s,t © W,, then: 

(i) as+btEwW, (abe 


(it) s-tTE W,. 


wz 
bl 


(iti) sup{s, t} © Wy. 
(iv) inf{s,t} € WY. O 
(See Problem 6.1.) 


We denote by (W,.(Q, ©), Tp) the subspace of all extended, real- 
valued, nonnegative functions f € C! for each of which there exists 
a monotone nondecreasing sequence {s,,} CW, of nonnegative 
simple functions such that f = sup{s,} in Tp. By Proposition 5.5 
(iv), Y, CC, that is, YW, consists of only measurable functions. 
The theorem below asserts that the set of all nonnegative measurable 
functions C;! coincides with W,. 


6.4 Theorem. Let C~': =C'(Q,X;R,), ie, the subclass of 
all nonnegative extended real-valued functions. Then for every 
fe Cx there is a monotone nondecreasing sequence {s,,} of non- 
es pee ae functions from VY, such that sup{s,} = f, that is, 
c;! 
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Proof. As already mentioned, van C Gs Therefore, we are left to 
prove that C;' C W,. We show that, for every f © C;', there is a 
monotone nondecreasing sequence {s,,} of nonnegative simple 
functions from Y, such that sup{s,,} = f. The latter is at the heart of 


the following construction. Let 


{f>e}a{f< #2}, 1=0,1,...,n2"-1 


For instance, 
Ao(n) = {f > O}N{F < a} = f°((0,2)), 
Ai(n) ={f > mf} OLE S a} = f((2,-%, 2"). 
In other words, 


Aj(n) = f*((#, $4), i=0,...,n2"-1, 


and 
Ain) = 7" ((n, 0d) ),.4 = 72". 


Therefore, all sets A;(n), 1 =0,...,n2”, are disjoint and obviously 
>/-measurable. Let us define 
n2” ; 
Sn(w) = D2 Lan w) (€ W,). 


4= 


Both f and s,, are depicted in Figure 6.2. 
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Figure 6.2 


3 ¢ 
N 
sas" & 
7 So 
NO 
+3 
Bae 
ae 
= 8 
v | 
oo 
oS 
VIG 


Clearly 5,41 > S,. In addition, s,,(w 
EQ: fw) <n, and f(w) >n, Vw 


and s,+; are drawn in Figure 6.3. 


Figure 6.3 
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Thus there exists sup{s,} = f (pointwise Vw € (2), and therefore 
f © W,, implying that Cz! CW. 
This proves that C;! = W,. oO 


Remark 6.5. Theorem 6.4 can be interpreted as follows. For 
each f € C there is a sequence {s,} C W, such that s,, — f in 
Tp. (That the sequence is monotone increasing is essential only for 
the existence of such a sequence through an explicit construction.) 
By Theorem 5.13, Chapter 3, it means that for each base neighbor- 
hood By, there is an N(By)-tail of {s,,}. Consequently, each By 
meets W, and hence, f € W,, that is, f belongs to the closure of YW, 
in the sense of the Tychonov topology tp. We shows that C;' C W,. 


To prove Y, C Cz! requires that C;' be first countable, which 


C;' would inherit from first countability of R° should the latter be 


first countable to begin with. Unfortunately R’ is not first countable, 
unless (2 is at most countable (a not very practical special case), as 
we learn from Problems 5.18 and 5.20, Chapter 3. First countability 
would provide a countable monotone decreasing sequence of base 
neighborhoods, from which we could select a sequence {s,,} con- 
vergent to a function arbitrarily picked out from the closure V,. 
This would prove that y is measurable and thus ¥, C C;'. Then 
WY = C,' would be a desirable property that C;' and thus C~' were 
topologically closed. 


These all remain wishful thinking though. Recall that in Remark 
1.15(zi), Chapter 3, we brought up the case of the cocountable topo- 
logical space (being Hausdorff and not first countable), in which a 
set contained all of its sequential limit points and yet failed to be 
closed. As the result of the above considerations, C;' remains a 
proper subset of Y (unless 2 is countable). Obviously, C;' is not 


closed in Tp, because if it were, we would have YW, C ‘te = Ge 
(which is absurd) for ¥ C C;". O 


6.6 Notation. Now we further enlarge the class of measurable 
functions by considering arbitrary extended, real-valued, measurable 
functions of C~!(Q, 2’). For each f € C7! and 0, being the function 
identically equal to zero on (22, denote 
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f* =sup{f,0} and f~ = —inf{f,0) = (—f)* 


(cf. Definition 7.7, Chapter 1). Clearly (see also Problem 7.15, 
Chapter 1), 


fr 20, f°  20,f=f"-—f, and |fl=fr+ Ff. 


By Proposition 6.4, f* and f~ are also elements of C-! (more 
precisely, elements of C;') if and only if f € C~!. We revisit this in 
Chapter 6, Section 1, regarding integration. O 


PROBLEMS 


6.1 


6.2 


6.3 


6.4 


6.5 


Prove Proposition 6.3. 


Let 2 be an uncountable set and let Y = {A CQ: A or A® is 
at most countable}. Show that f € C~1(Q, 2’) if and only if f 
is constant everywhere except on an at most countable subset 
of 22. 


Prove that f = g (mod y) if and only if f* = gt (mod ju) and 
f =g (mod 4). 


Show that f € [0],, if and only if f*, f~ € [0] 
= (9), M Cy. 


+, where [0|* 


pe? Lt 


Theorem 6.4 states that for each f € C.. there is a monotone 
nondecreasing sequence {s,,} of nonnegative simple functions 
(from W,) such that sup{s,,} = f. In Remark 6.5, it is shown 
that unless C;' is first countable in the sense of tp (which in 
most applications it is not), C;' is a proper subset of W, (the 
Tp-closure of W,). Does it mean that the converse of Theorem 
6.4 does not hold [i.e., that, although sup{s,,} € YW, of some 
sequence {s,,} of simple functions, it need not belong to C;'|? 
Analyze and explain this. 
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NEW TERMS: 


W.(Q, 3’) = W,(O, 37; R), nonnegative simple functions, 354 

canonic representation (expansion), 355 

canonic expansion (representation), 355 

(V,F,), semi-4-space, 355 

W(Q, ©’) = W(Q, 2’; R), simple functions, 355 

(W,(Q, ©), Tp), extended, real-valued, nonnegative functions 
and their limits, 355 

f* = sup{f, 0}, 359 

f- = —inf{f,0), 359 


Chapter 6 


Integration in Abstract Spaces 


The historical significance of the development of measure theory is 
that it created a base for a generalization of the classical Riemann 
notion of the definite integral (which since 1854 has been considered 
to be the most general theory of integration). Riemann defined a 
bounded function over an interval [a,b] to be integrable if and only 
if the Darboux (or Cauchy) sums 5>""_, f(t;))AU;), where )>""_, Ji, is 
a finite decomposition of [a, b] into subintervals, approach a unique 
limiting value whenever the length of the largest subinterval goes to 
zero. The French mathematician, Henri Lebesgue (1875 - 1941), 
assumed that the above subintervals J; may be substituted by more 
general measurable sets and, in addition, that the class of Riemann 
integrable functions can be enlarged to the class of measurable 
functions. In this case, we arrive at a more advanced theory of inte- 
gration, which is better suited for dealing with various limit proces- 
ses and which led to the contemporary theory of probability and sto- 
chastic processes. 


Although many results existed prior to Lebesgue's major work 
between 1901 and 1910, Lebesgue's construction appeared to be the 
most efficient. After 1910, a large number of mathematicians began 
to engage in work initiated by Lebesgue. Some of the most signifi- 
cant contributions were made by the Frenchman Pierre Fatou 
(1878 — 1929), Italian Guido Fubini (1897 - 1943), Hungarian 
Frigyes Riesz (1880 — 1956), Pole Otto Nikodym (1887 — 1974), and 
the Austrian Johann Radon (1887 - 1956) who introduced the 
Lebesgue - Stieltjes integral and whose work led to the modern 
abstract theory of measure and integration. 


J.H. Dshalalow, Foundations of Abstract Analysis, 361 
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We begin this chapter with conventional integration relative to 
positive measures using the same techniques as those for the original 
Lebesgue integral (but with no sacrifice to generality). It is followed 
by applications of the integral (such as those to probability and to the 
relationship with the Riemann integral in Euclidean spaces). We note 
that the classes of integrable functions we are concerned with have 
no assumptions on the topological structure of sets (that is, carriers) 
on which they are defined. Furthermore, we introduce integrals of 
these functions also with respect to signed and complex measures. 
These features warrant the title of this chapter. (In the literature on 
measure theory, abstract integration is sometimes used in the con- 
text of more specialized or structured spaces.) We then further 
proceed with absolute continuity of measures (including signed and 
complex measures) and multiple integration. 


1. INTEGRATION ON C-1(Q, 5) 


We begin the theory of integration with integrals of nonnegative 
simple functions, which we introduced in section 6, Chapter 5. Prior 
to the definition of the rudimentary integral, the proposition below 
states that integrals of nonnegative simple functions are invariant of 
their representations. 


1.1 Lemma. Let (Q,2’,1) be a measure space and let 
s € W,(Q, X’) have two representations: 


8 = Dijn1 Gila, = Dop-1de12,,. 
Then it holds that 


7-1 ti (Ai) = 4-1 0k (Br). 


Proof. The above representations are due to the two decomposi- 
tions of (2: 


Q= ia Ai = pai Bk- 
Then 
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U(Ai) = Oe (Ain Be) and w( Br) = 3 u(Ai Nn Bx), 


which implies that 


ae {H(A;) = ae (2 aip( A; By) 


and 


pa Ok Be) = dopa din bee Ai M Br). 


By noticing that a; = by on A; N By, we are done with the proof. O 


1.2 Definition. Let (2, 2’,j4) be a measure space and let 
s € W,(Q, »’) with the representation 


nm 
8 = D141 4,. 


Then the number 


a1 GAs) 


is called the integral of s with respect to 4, and it is denoted by one 
of the symbols: 


Js(w)du(w) or fs(w)u(dw) or, for short, fsdpu. oO 


The value of the integral of a function s does not depend upon its 
representation. Consequently, by Lemma 1.1, this definition is con- 
sistent. In other words, the integral s ++ [sd defines a functional 
on W, valued in R. 


1.3 Proposition (Properties of the integral). 
(i) For each measurable set A € %), 


Js du = pA). 


Al 


(ii) The integral [| is a linear functional valued in R,., i.e., 
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J(as + bt) du =afsdut bftdp, 


where s,t € VW, anda,b € R, 


(iii) For any two nonnegative simple functions, s,t € Y,, such 
that s < t, it holds that 


J sdu < [tdp (monotonicity). oO 


(See Problem 1.1.) 


1.4 Example. Let f be the Dirichlet function defined as f = 1g 
(earlier introduced in Example 4.7, Chapter 2), where Q is the set of 
all rational numbers (hence a Borel set). Thus f € Y(R, 8). By 
Proposition 1.3 (2), the integral of f with respect to the Lebesgue 
measure is 


figdd = d(Q) = D@) = 0. Oo 


For the upcoming definitions and statements we denote a mono- 
tone nondecreasing sequence of functions by { f;,} and a monotone 
nonincreasing sequence of functions by {f,} | . 


1.5 Lemma. Let {s,}? CW, and s€W, such that s< 
sup{s,,}. Then 


fsdu < sup{ fsndy}. 


Proof. Let s = )*" ,a;1,4, and let « > 0 be any small number. 
Denote 


By, = {w: 8, > (1 —«)s} (€ &). 
(s,ae> (12) alg dp t= 1,2 ini 
By the definition of {B,,}, it follows that {B,,} 7 0, which implies 


that {A;1B,}f A,;. Therefore, by continuity from below of ju 
(Lemma 1.6, Chapter 5), 
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Js du = iG A;) = ini jim, uA; A Bn) 


n+ 00 
The last equation is due to the relationship 
slp, = 1p, Aida, = 204214: Laine,.- 
Thus, 
sup{{sndy)} =lim, fsndu 


> (1—e) Jim, fsizp,du=(1—e)fsdp, 


which proves the statement because the inequality holds for each 
E> 0. O 


1.6 Corollary. For {s,}1, {tr}? CW, such that sup{s,} = 
sup{t,}, it holds that 


sup{ f s,dj} = sup{ [t,du}. oO 


(See Problem 1.2.) 


Let us now turn to the integral of the functions from the more 
general class C;' = C~'(Q, ©; R) which we became more familiar 
with in Theorem 6.4, Chapter 5. 


1.7 Definition. Let (, 2’, 2) be a measure space and let f € 
C;,'. By Theorem 6.4, Chapter 5, there is a monotone nondecreasing 
sequence {s,,}? CW, such that f = sup{s,,}. Hence, it is plausible 
to define 


JS fd = supf f snd} 


and call it the integral of (an extended, real-valued, nonnegative 
function) f with respect to measure |. By Corollary 1.6, the value of 
the integral { fd is unique. O 
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1.8 Proposition. The integral introduced in Definition 1.7 is a 
positive, linear, monotone nondecreasing functional on C;'. 


Proof. Let f,g € Cz! and a,b € R,. Then 


ti _ SUP{Sn}, g= sup{t, } and af 7 bg = sup{as,, - btn} 
yield that 


S(af + bg)dp = sup{ { (asp + btr)du}, 
which, by Proposition 1.3 (iz), equals 
sup{af sndyu + bf tnd} 
= asup{ fs,du}+ bsup{ [tndu} = af fdy+ bf gdy. 
Now let f < g. Then we have: 
sups,, < supt,; hence sz, < supt,. 
Thus, by Lemma 1.5, 
fsdu < sup ftrdp, 
and finally, 
{fd =supfspdu < ftrdu = fgdp. O 


1.9 Examples. 


(i) Let €, be a point mass on a measurable space (Q, »’) for a 
fixed bE Q and let s=)o" ails, € W(Q, 27) be such that 
s(b) = a;,, for some ip € {1,..., m}. Then 


m 


fsdey = Dajey(A;) = aj, - 1 = 5(b). 
fal 
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Now let f € C;'(, 2’). Then there is a sequence {s,,} CW, such 
that f = sup {s,,}. Thus 


J fdey = sup{sn(b)} = f (0). 


Moreover, if p. = ce, (for some c > 0), using similar arguments we 
get [fd = cf(b). 
(it) Let 
b= H0CiEa,- 
By Problem 1.4, 
J fdp = VRocif (ai). 


Specifically, if c; = (")p'(1 — p)", then ju is the binomial measure 
Bn.p (See Example 1.8(éii), Chapter 5.] Furthermore, if f(x) = ec”, 
for 0 € R, then the transform of the binomial measure 


0 ++ ff(x)u(dx) = fe*B,,p(dz) 
= Vino cre" = (1+ pe” — p)" 


is a function in @ and is referred to as the moment generating 
function. In the general definition of a moment generating function, 6 
runs the complex plane C. 


(i277) Let (Q, 3’, 44) be the measure space with = (0, 1], 

5} = B({0,1]), and = A (Borel - Lebesgue measure on B([0, 1]). 

Let C’ be the Cantor set and G,,, be the open intervals of the Borel - 
1/2 


Lebesgue measure 5(3)” (introduced in Example 3.11, Chapter 5). 


Let us define the function 


ney={ rEC 


oy DE Gyy WH 1,2, 205 
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We evaluate the integral [ f(2)A\(dx) (with respect to the Borel - 
[0,1] 

Lebesgue measure). First of all, we have to identify the function f, 

which can be represented in the form f = sup{s,}, where 


1 ZEC 
Sula) = 0, rE [0, 1]\(Gy U... UG,UCG) 
aa SG. k= 1lu2.0 


Clearly, s, € %([0,1], BN [0,1]) and f(x) = sup{s,,(x)}. Thus 


f €C;*((0, 1], B((0, 1) 
and hence 


f f(@)A(dx) = sup f sn(x)d(dx) 


[0,1] [0,1] 


sup |1-A(C) +0-A([0, I\{Gi,---, Gu CY +O kA) 
k=1 


(iv) Let {ju} be a sequence of measures on a measurable space 
(Q, 3’). Then pp = D>, fun is a measure on X’; and foran A € Y, 
the integral of the indicator function 1, is 


Sls du = p(A) = ey en(A) = a S14 dita. 
Let s € W,(Q, 2’). Then 
Jsdp = dope HAR) 
p=1%k > n—tin(Ar) (1.9) 
= ret oka tkbn( Ag) = Daas sditn. 


Now, for f € Cz', we have f = sup{s,;} such that {s;}? CW. Let 


1. Integration on C~(Q, »’) 369 


nm 
bin, = Deinr J) 85 bi. 
In as muchas {b;, } is monotone increasing, 


sup sup{bj,} = sup sup{by,,}, 
which yields that 


sup iS sj dei = sup Dia Sf dui = DS fdus. (19a) 
Therefore, 
J fdy = sup f sjdu [by (1.9)] = sup D1 fides [(by (1.9a)] 


= ead fd. 


Thus we showed that 
Jfd ( 2) = id J aan , Ve Ce, 2). oO 


Now we further enlarge the class of integrable functions in light 
of Notation 6.6, Chapter 5. 


1.10 Definitions. 
(i) Let (Q, X, 4) be a measure space and let f € C-1(Q, 2’; R) 
[or C-1(Q, X';R)]. If at least one of the integrals, Jf aj ot 


| fd, is finite, we say that the integral of f with respect to meas- 
ure |t exists and denote this integral by 


Jfdu=Jfftdp—ff-dp. (1.10) 


We also denote 


AQ, ©, u;R) = {f € C10, Y;R): f fd exists}. (1.10a) 


370 CHAPTER 6. INTEGRATION IN ABSTRACT SPACES 


If either integral of the functions f* and of f~ is finite, we say that 
the function f is ~u-integrable and again denote the integral of f by 
formula (1.10). The subset of C~! of all y-integrable functions is 
denoted by L1(Q, »’, ju; IR), that is, 


D1(Q, ©, u;R) 


={f ec (0,2): [ftdu < coand ff-du < oo}.  (1.10b) 


The letter L stands for Lebesgue space (whose space-feature will be 
established later on) and L' means that this Lebesgue space is a 
subclass of the more general L? family of spaces explored in [Dsh2]. 


Note that 
flflde = fftdut ffodp, (1.10c) 


which is due to |f| = f*+ f~ and Proposition 1.8. In light of 
(1.10c), expression (1.10b) can be rewritten as 


L'(Q, Z, uw; R) 
= {f €C1(0,E): fi fldu < oo}. (1.10d) 


If a measurable space (22,4) is specified, the notation f € 
(Q, 37, uw; IR) or f € L'(Q, ©, ; R) is sometimes shortened to f € 
Au) or f € L(y). 

(it) If Q=R", X’ = B, and yu is the Borel - Lebesgue measure 
A and if the integral of the function f in (1.10) exists, it is called the 
Lebesgue integral of f. If f is \-integrable, we write f € L1(A). 

(iii) If Q=R, © =B, and w=ypr (a Borel-Lebesgue- 
Stieltjes measure induced by an extended distribution function F’), 
and if g € L(Q, 2’, wr; R), then the integral in (1.10) is called the 
Lebesgue - Stieltjes integral of g; and we write g € L'(p) if g is 
jup-integrable. 


1.11 Remark. Let C~!(Q, ©;iR) be the space of all extended 
real-valued random variables on a probability space (Q, »’, P). From 
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Example 4.5(2), Chapter 5, we recall that any measurable function 
X €C1(Q, X) on (Q, ©’, P) is arv.. If X € L'(Q, 7, P; R), then 
the numeric value { XdP is called the expectation of the r.v. X, in 
notation, EX or ELX]. From the definition, EX makes sense only if 
X is P-integrable, which is equivalent to [|X|dP < oo (often per- 
ceived as an ambiguous restriction in probability books). O 


1.12 Proposition. The integral is a linear, monotone, non- 
decreasing functional on the vector space L*(Q, 2’, 1). O 


(See Problem 1.8.) 


1.13 Proposition. For each function f € L1(Q, 3, ;iR), the 
following holds: 


lf fdul < f\flde. 


Proof. Obviously, |f| > f and |f| >— jf. Thus, by Proposition 
1.12, we have 


S\fldu = ffdu, f\fldu > f(-f)du = —ffdu 


and consequently, 
S\fldw > \ffdul. Oo 


1.14 Notation. Let f € C~!(Q, ©; R) and A € SY. Then, denote 


Jfdu= [fla de. 


Specifically, it follows that 
Sfdu = ffdp. Oo 
9) 


Recall Definitions and Remarks 5.7, Chapter 5, regarding proper- 
ties that hold almost everywhere and the quotient sets of functions 
induced by a measure. 
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1.15 Lemma. Let (©, 3’, 11) be a measure space and let f € Cy". 
Then f fdu = 0 ifand only if f € [0] , where [0)* = [0], Cz". 


ye lu 
Proof. Denote N = {f > 0} (which is an element of »’). 


(i) Let f € [0];. Then NEN, CY. Let s, = nly (€ Y,), 
p=, 2..2ss4 Theretore, 


fsndp = np(N) =0, for all n. 


Denote s = sup{s,}. Then, by Proposition 5.6 (iv), Chapter 5, 
s € C;' and 


fsdu = sup{ [s,du} = 0. 
Furthermore, f = s,, = 0 on N°. Although f is arbitrary on N and, 
in particular, not necessarily oo, we have that s,, | oo pointwise on 


N. Consequently, f < s on 92 which, by monotonicity (Proposition 
1.8), yields 


0< ffdu< fsdu=0 
and hence [ fd = 0. 
(ii) Now let [ fdu = 0. Denote 
NaH SF} C= 7 (S06) HH 12,025 


Obviously, N,, € +’ and N,, Tt N, where 


N=UM={f>0}e5. 


n=1 
By continuity from below of ju, 


lim LN) = HN). (1.15) 


nm — CO 


1. Integration on C~'(Q, »’) 373 


Clearly, nf > 1y,. Again, by monotonicity (Proposition 1.8), we 
have that 


O= ffdu> fiindu=+.u(N,), 


which leads to u(N,,) = 0, n = 1,2,... . From (1.15) it follows that 
uN) = Oand hence N € N,,. Therefore, f € [0]7. oO 


1.16 Proposition. Let (Q, 2’, .) be a measure space and let f,g 
€ C;' such that f = g (mod 1). Then 


Sfdu= fgdu. 
Proof. By Problem 5.2(iiz), Chapter 5, we have that N = 
{f #g} © X. Therefore, by the above assumption regarding f and 


g, N EN, and the functions fly and gly are elements of the 
quotient set [0]. By Lemma 1.15, it follows that 


{findp = fglndp = 0. 
On the other hand, 
Jfdu= f(fln + flne)du 
= ffindut+ ffinedu = fflwedp. 
Similarly, 
Jadu = fglnedp. 
The statement follows from flye = gly-, Vw € 22. Indeed, 
Jf=ff+ff=Jff and fo=Jo+fo=fo 
N Ne Ne N Ne Ne 


with f = gon N°. O 
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1.17 Proposition. Let (Q, 7, 44) be a measure space and let 1G 


€C1(0,5;R) such that |f|<g ae. Then g€ L(Q, ©, w;R) 
implies that f € L1(Q, ©’, ;R). 


Proof. Let g € L1(Q, »’, u;R). Then by Proposition 6.3(iiz), 
Chapter 5, we have that 


g’ = sup{g, |f|} €C*. 
Clearly, 


|f| < g' everywhere and g’ = g (mod yz) (show it), 


and by Problem 1.12, g’ € L'(Q, 2’, y;R). Then, by Problem 1.9, 
fe L(O,2, wR). Oo 


1.18 Proposition. Let f,gEC(Q,) and f or gé 
D1(Q, 7, w). Then 


wz 


{fdu= fgdp, foreach AEX, (1.18) 
A A 


implies that f = g (mod 1). O 
(See Problem 1.22.) 


Theorem 1.19 and Corollary 1.20 modify and, to some extent, 
refine Proposition 1.18. 


1.19 Theorem. /f 1 is o-finite, f,g € L(Q,™, ;R), and 


{fdu< fgdu, foreach Ae x, (1.19) 
A A 


then f < g p-a.e. on Q. 
Proof. 
a) Let ps be finite. Denote 


An ={f >9+2, |g| <n} 


n? 


={f >g++}n{Ig| <n}. 
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Then, since by our assumption, [ fdju < [ gdp for each A € Y, we 
A A 


have 
L: = fgdu> ffdu > fg++)du 
An A, An 
(1.19) 
= M: = fgdu+ 5u(An). 
Ay 


On the other hand, by Proposition 1.13, 


Sgdu 


n 


< figldje < fndp = np(An) < ov. 


n An 


Therefore, from (1.19a) and because f{ gd is finite, L > M, which 
An 
yields that j4(A,,) = 0, for each n. 


Thus, 


Lt ( UA) < ei t(An) = 0. 


n=1 


On the other hand, from 


UA 7 ( Uts sl ) : (U1 lg s ") 
{f > 9} “(os gla) is finite} 
we conclude that w{f > g: g is finite} = 0. Hence, 
Hf >g:g > —co} =0. 
Letting B, = {g = —o0, f > — n} we have 


B,, By 


and therefore, 
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or, equivalently, nu(B,,) > cop(B,). This holds true if and only if 
1u(B,) = 0 (as the consequence of the agreement that co -0 = 0). 
Thus, 


» (Us) =f >9,9=—cof} =0. 


n=1 


In summary, we proved that w{f > g} = 0 implies that f < g p-ae. 
on (2. 


b) Now, let ps be o-finite and let 44, = Ressqg, fu. Then 


J fdu= flo, fdu< flo,gdu 
AAO, A 2 


and hence f < g pi-a.e. on 2,,. The rest of this case is obvious. i 
The reader can easily conclude the following. 


1.20 Corollary. If 11 is o-finite, f,g € L(Q,™, u;R), and 


ffdu= fgdu, for eachAe ZX, (1.20) 
A A 


then f = g p-a.e. on Q. O 
(For a related discussion also see Problem 1.23.) 


We conclude this section with a proposition below that is often 
cited in the sequel and have the reader prove it in Problem 1.14. 


1.21 Proposition. Each function f € L*(Q, 2’, ;R) is finite p- 
a.e.on Q. O 


PROBLEMS 


1.1. Prove Proposition 1.3. 


(i) For each measurable set A € &, 
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JVs du = pA). 
(ii) The integral [ is a nonnegative linear functional, i.e., 


J (as + bt) du = afsdut bftdp, 


where s,t € VW, anda,b € R,. 


(iti) For any two nonnegative simple functions, s,t © V,, such 
that s < t, it holds that 


fsdp < [tdp (monotonicity). 


1.2 Prove Corollary 1.6. For {s,}f?, {tr}? CW, such that 
sup{s,} = sup{t,} it holds that 


sup{ [sndye} = sup{ [tady. 
[Hint: Use the fact that s; < sup{t,} and t, < sup{s,}.] 


1.3 Let (Q, 2’, 4) be a measure space and c > 0. Show that for 
each f € L'(O, X’, uw; R), 


[fd(cu) = cf fa. 


1.4 Let f €C71(Q, XY, u;R) such that pp = $7" yci€a, (where c; > 
0, i = 0,...,n). Show that [ fdy exists and equals 


Sfdp = oct (a). 


15 Let f €C1+(Q, X, u;R) and suppose that there are two non- 
negative integrable functions wu and v such that f=u-—v. 
Show that f € £1(Q, 2’, yu; IR) and that 


Sfdp= Judy — Jord. 


1.6 Show that £'(Q, X’, 1; R) is a vector space over R. 


378 
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1.8 


1.9 


1.10 


1.12 


1.13 


1.14 


1.15 
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Al 


Show that (£1(Q, 2’, pu; 


),R, +,-, X) isa Riesz space. 


Prove Proposition 1.12. The integral is a linear, monotone 


nondecreasing functional on the space L'(Q, ©’, 11; R). 


Let (Q, 3’, 4) be a measure space and let f € C7'(Q, 7; R). 
Show that f € L'(Q,X,;IR) if and only if there exists 
g € L1(Q, X, w;R) such that |f| < g. 


Show that {21(0,©,e,;R):a€ 0} = {11(Q, 2, €4; 
a € QQ}. 


wz 
hl 


Show that if f€C1(Q,X;R) then f € [0], yields that 
J fd = 0. Does the converse hold true? 


Let (Q, X’, 2) be a measure space and let f € L1(Q, 2’, yu; R), 
g€C'(Q,2;R) such that f —g€ [0],. Show that ge 
L'(Q, ©, u;R) and that [fd = fgdp. 


Generalize Proposition 1.16. Let (Q, 2’, 4) be a measure space 
and let f,g € C-1(Q, YR) such that f = g (mod 1) and f € 
(Q, 37, wR). Then 


Sfdu= fodu 


and g € L(Q, 1’, 4; R). 


Show that each function f € £1(Q, 2’, ;iR) is finite p-a.e. on 

Q. (Hint: Let A={|f| = oo}. Show that ap(A) < o, 

Va € R,, and then show that the inequality lim  ny(A) < 
no 

oo implies that j1(A) = 0.] 


Show that for f €C1(0, 2), f fdy =0 for each A € ZY if 
A 
and only if f € [0],,.Does this assertion generalize Lemma 


1.15? 
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1.16 


117 


1.18 


1.19 


1.20 


1.21 


1.22 


Show that 


L'(Q, ©, wa + po; R) = L'(Q, 2, pr; R) 9 £1 (Q, Y, po; R). 


Let 7 be a Poisson measure and let f € C;'(R, B;R). Show 
that 


ffm = Vo e OH Fn). 


Under the conditions of Problem 1.5 assume that 
a) f(z) ==", 
b) f(x) =I(e +1) 100)(z), where I(x) = fy°e*t? "dt 
is the gamma function, 


and find in each case the integral of f relative to measure 7y. 
Let {un} be a sequence of measures on a measurable space 
(Q, X’), {c,} be a sequence of positive real numbers, and let 


[= >>, Cnn [which is a measure on (Q, 3’)]. Show that for 
every f € L'(Q, Y, u;R), 


Jfdu = 5 nl 1 eles 


Let ys and v be two measures on (2, 2’) such that pp < v. Show 
that for each fe L'(0,2,y;R)NL(Q,2,v;R), the 
integral [fd(v —) makes sense, f € L'(Q, Y,v — y;R), 
and that 


[dv — ) = ffdv — f fap. 


Let yu and v be two measures on (2, 2’) such that pp < v. Show 
that for each f € C7'(O, ©; R), ffdu < ffdv. 


Prove Proposition 1.18. Show that if f,g € C-'(Q, ©;R) and 
forge L'(Q,2,u;R), then 
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{fd = fgdu foreachAe xX (P1.22) 
A A 


yields that f = g (mod 1). 


Show by a counterexample that dropping the condition f or g 


€ L1(Q, ©’, w;IR) in Problem 1.22 need not yield f = g (mod 
jt) even if f and g are nonnegative. 


Is L(Q, 3; yz, R) a linear space? 


Let g €C1(, Y,u;R, B(R)) such that fgdu>0 for all 
A 
A € 3’. Show that g > 0 p-a.e. 


In Problem 1.15, it was stated that f € [0], if and only if 
J fd =0. Corollary 1.20 stated that f fdy = fgdu if and 
A A A 


only if f =g modulo p, provided that at least one of the 
functions f or g € L'(O,X;v,iR). These two seem to be 
similar if in Problem 1.15 we replace f with f — g. Is it so? 


Is L'(Q,L;u,R,_) a proper subset of C;'? What is the 
relationship between L(Q, 17; 4, R,) and C,!? 


Let f €C1(Q, ©; R). Show that f € L'(0,5, u;R) if and 
only if for each « > 0, there is a function g € L'(Q,™, uw; R,) 
such that 


J |fldu se. 
{\fl29} 


Show by a counterexample that [1 is not an ¥-space. 
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NEW TERMS: 


Lebesgue, Henri, 361 

Fatou, Pierre, 361 

Fubini, Guido, 361 

Riesz, Frigyes, 361 

Nikodym, Otto, 361 

Radon, Johann, 361 

integral of a nonnegative simple function, 362 
Dirichlet function, 364 

integral of an extended nonnegative function, 365 
integral with respect to Dirac point mass, 366 
moment generating function, 367 

integral of an extended real-valued function, 369 
j-integrable function, 370 

L1(Q,&, 1; R), Lebesgue space, 370 

(Q, ©, 4; R)-space 370 

Lebesgue integral, 370 

Lebesgue - Stieltjes integral, 370 

expectation of a random variable, 371 

L' as a vector space, 371 

integral as a linear functional on L', 371 

L' asa vector lattice, 378 
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2. MAIN CONVERGENCE THEOREMS 


The following result is one of the basic convergence theorems a 
special case of which (Corollary 2.2) was originally proved by the 
Italian mathematician, Beppo Levi (1875 — 1961). 

2.1 Theorem (of Monotone Convergence). Let {f,,} 1 CC’. 
Then 


Jsup{fn}du = sup{f frdu}. (2.1) 


Proof. Let f = sup{f,,}. Then, by Proposition 5.6(iv), Chapter 
5, sup{fn} €Cz'. Thus, the integral on the left-hand side of (2.1) 
makes sense. On the other hand, for each element f/f, € Ce. there 
exists a monotone nondecreasing sequence of nonnegative simple 
functions {eo tt C W, such that sup{s\””: ke 1, SF 
Let 


ty = max{s\), Sees s*)}, 


Because W, is a lattice, it follows that t, @W,, k=1,2,... 


Furthermore, {t,} is monotone nondecreasing. Because {f,,} is 
monotone nondecreasing, we have 


SS frets So) SRS fee, Stes 
and hence 3 < fe, 7=1,...,k%, which leads to 
te < fr (2.1a) 


sup{t,} < sup{fi.} = f. (2.1b) 


and 


On the other hand, ty, > s(” for ky > n; this yields 


n 


sup{s”: ky =1,2,...} = fn < sup{t,: k= 1,2,...}, 
kn 
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and, consequently, 
sup{ fn} = f < sup{t,}. (2.1¢) 


Thus, by (2.1b) and (2.1c), 


f = sup{ty}. 


Now, the facts that f = sup{t,} and that {t,} is monotone non- 
decreasing imply that 


J fd = sup [tpdy. 
In as much as t; < fi), by (2.1a), we have by Proposition 1.8 


Jtrdp < ffedp, 
which yields 
J fd = supftpdu < supf fed. 
Finally, the inverse inequality holds due to f;, < f and Proposition 


1.8. O 
2.2 Corollary (Beppo Levy). Let {f;,} C Cz‘. Then 


J baa du = 3 [fd Oo 


n= n=1 
(See Problem 2.1.) 


The monotone convergence theorem can be generalized for an 
arbitrary monotone sequence under a minor constraint. 


2.3 Theorem (Generalized Monotone Convergence Theorem). 
Let {fn}? CCQ, ©;R) and g € C7(Q, 2; R) such that fr > g 
for all n and [gdp > —co. Then, 


sup{ J fndu} = fsup{ fn} dy. O 
(See Problem 2.2.) 
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The next is the very often-used Fatou's lemma. 


2.4 Lemma (Fatou). Let (Q, 5’, 4) be a measure space and let 
{fn} CCQ, &). Then 


flim frdu < lim f frdp. 


Proof. By Proposition 5.6 (v), Chapter 5, lim f,, € C;' and by 
Proposition 5.6 (iv), Chapter 5, 


On. = intl fj kon} e C.. 
Clearly, the sequence {g,,} is monotone nondecreasing and hence 
sup{g,} =limf, and g, < f;, forallk > n. 
By monotonicity of the integral, 
loud = J fda, Vk > % WH 1,2, 045 
which implies that 
Sond < inf{ f fd: k > nj. 
Finally, by the monotone convergence theorem, 
fsup{gn}dp = sup{ fgndp} 


< sup inf{ f fd} = lim f frdy. O 


2.5 Definition. Let f, {f,} C L'(Q, XY, u;R). The sequence 
{fn} is said to converge to f in mean if 


Jlf—-f|dp— 0. i 


We now formulate and prove one of the central results in the 
theory of integration. As with the monotone convergence theorem, 
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the following theorem enables us to interchange the limit and the 
integral for a pointwise convergent sequence of functions. However, 
it does not require that the sequence be monotone nondecreasing and 
nonnegative. On the other hand, the sequence needs to have an inte- 
grable dominating function, and thus it is not a generalization of the 
monotone convergence theorem. 


2.6 Theorem (Lebesgue's Dominated Convergence). Let 
(Q, ©’, 4) be a measure space and let {f,} CC~1(Q, ©;R) be a 
pointwise a.e. convergent sequence. Suppose that there is a [- 
integrable function g (€ L*(Q, 2’, 1;1R)) such that g > 0, and that 
lfn| <9, n =1,2,... . Then the following are true. 


(i) There exists at least one function f €C~', such that 
f <0, to which the sequence { f,,} converges a.e. in the topology 
of pointwise convergence. 


(ii) f € L'(Q, XY, w;R) and {fr} C L'(Q, ZY, w;R). 

(iti) The sequence { f,,} converges to f in mean, that is, 
lites] | f = ful da =O. 

(Go) ffdu = limp se | jadi 


Proof. 


(i) By our assumption, there is a negligible set II such that 
liar eee fr(w) 
exists for all w € II° and there is a p-null set N; > II. Therefore, 
Nj CII° and 
kha ore fr(w) 
exists for all w € N¢. Because g € L1(Q, »’, 1), by Proposition 1.21, 


it follows that g is finite pi-a.e. on (Q, that is, there is a pu-null set No 
such that g(w) < co forall w € Ns. Define the function 


F(a) = La(@) img J, (e), (2.6) 
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where A = (N; U No)°. Clearly, f, converges to f pointwise j-a.e. 
on 9 and hence, by Proposition 5.6(ziz7) and (vi), Chapter 5, f € 
C-!. Indeed, because f,, and 14 € C™!, it follows that f, 14 € C7! 
and that f,14 — f in the topology of pointwise convergence; the 
latter implies that f,, — jf pointwise p-a.e. on 22. 


(it) From (2.6) it follows that on set A, limp f, = f; in 
addition, {f,} is dominated by a finite function g on A. Thus, 
|f| < g on A and, due to (2.6), f = 0 on A°. Hence, 


|f| < 00 and |f| <g, Wwe. 


By Proposition 1.17 and in as much as |f| < oo, f € L1(Q, Y, p). 
Also by Proposition 1.17, {f,} C L1(Q, 7, 2). 


(iit) We prove that f,, is convergent in mean to f, that is, 
Let on = |f — fal (€ Cz(O, Z)). Then, 0< gn < [fl +g. 
Because 


Ifl+g€ L(Q, 2, p), 


it follows that g, € L1(Q, XY’, 1), by Problem 1.9. 


[Observe that because linearity of the integral holds just on L', 
we do need to show that g,, € L! which would lead to 


S\fl+o-9n = fUfl+9) + f° — Gn). 


Applying Fatou's lemma to the sequence {| He +9-9n}, we 
have: 


flim(|f] + 9 — gn)du < lim f(|f| +9 - 9n)due 
= f(\f| + g)du + lim [( — gn)du 


= ffl + 9)du— lim fondu (2.6a) 
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Because f, — f a.e., then g, > 0 ae., and hence |f| + g-— Gn 
— |f|+ 4g a.e. which implies that 


lim (|f] + 9—9n) =|fl|+g ae. 


By Proposition 1.16, 


Slim ([f] +9 — n)du = S(lf| + g)du, 


which, together with inequality (2.6a), yields 


S(fl+9du < f(s] + Qdu—tim fondu 


or, equivalently, 
lim fgndu < 0. (2.6b) 


Because g, > 0, (2.6b) reveals that 


lim fgndpu = 0 
and thus 


lim f\f — frldu = 0, 
which proves (222). 
Now (iv) is due to Problem 2.6. O 
2.7 Examples. 


(7) We evaluate lim fone — x)"X(dx). First observe that the 


sequence {na(1 — x)"} is convergent to the function 0 pointwise on 
[0, 1]. However, it is an easy exercise to show that the sequence 
{na(1 — x)"} does not converge to 0 uniformly. Otherwise, we 
could interchange the limit and the integral. (See Problem 3.12 of the 
next section.) Fortunately, the functions na(1 — x)” are uniformly 
bounded by 1. Therefore, function 1 can be taken as a pertinent 
integrable majorant function in the Lebesgue dominated convergence 
theorem. This enables us to interchange the limit and the integral and 
conclude that 
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lim, [}na(1 — 2)"\(dx) = 0 
n— oo JO . 
[We can verify this result by direct computation of the integral 


1 n _ n 


and then passing to the limit.] 


(ii) Calculate lim f,’(1 + £)"e~?* (da). Clearly, 


(1+ 2) "Inge < e* € 1. 
Hence, by the Lebesgue dominated convergence theorem, 


jim, f (1 + 2)" To n(ve7?* Adz) 


dv 


= flim. (1+ £)"Lom ae" A(dx) = fo°e* (dz) = 1. oO 

2.8 Remark. Note that we treated fonad —x)"X(dx) in 
Example 2.7(7) informally both as Lebesgue (L) and Riemann (R) 
integrals (in as much as they are identical in this case), although a 
formal relationship between the two needs to be rigorously devel- 
oped, which we explore in Section 3. The same applies to Example 
2.7(22). In Problems 2.9 - 2.11 we therefore assume that the Lebesgue 
integrals are equal to Riemann integrals. O 


The following are analogues of the main convergence theorems 
(monotone convergence theorem, Fatou's Lemma, and Lebesgue's 
dominated convergence theorem) for measures, that are often used in 
probability and control theory. The theorems are essentially based on 
the results of Onésimo Hernandez-Lerma and Jean Lasserre [Her], 
which were established in 2000 under weaker conditions than those 
in previous texts and papers. 


2.9 Lemma (Fatou). Let f € Cz'(Q, 2) and {1, p11, fl2,... } be 
a sequence of measures on 3) such that for each A € 3’, lim p,(A) 
> yA). Then 
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Sfdu < lim ffdpn. 


Proof. Let {s,}7 CWO, X) such that s, 7 f and 


— yr _(*) 
Sk = Da; La, 


Hence, foreach k= 1,2, 2.2, 


lim f fdpin > impo f skdpin = lim noo a pin(Ag,) 


OY litt sce tin Ay) > fords 


The statement now follows from the definition of integral. O 


2.10 Theorem (Lebesgue's Dominated Convergence for Mea- 
sures). Let f € C~'(Q, ’) and {v, 1, [u1, [l2,... } be a sequence of 
measures on 3) such that for each AEX, [,(A) > MWA), 
[in <v, and { fdv < ow. Then 


Proof. Because ju, <v, it is easy to verify that v — pi, is a 
measure on 2’. Due to Problem 1.25, 


Jfdw — un) = [fdv — f fdpn. 
Furthermore, 


lim(y — in)(A) = (A) — limpr,(A) 


= (A) — lim, pin(A) = V(A) — (A) = (Y — pA). 


n> co 


The last inequality holds true, because obviously js < v and hence 
vy — {4 is a measure on 2’. Now, all conditions of Fatou's Lemma 2.9 
are met for the sequences {v — j1,,} and {,1,,} and therefore, 


[fd — w) = ffdv — [fap < limf fav — pun) 
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= lim(f fdv — f fdpn) = ffdv —limf fdpn 


and 


Jfdu < lim f fdpn. 


Combining both inequalities we have 


limf fdpn < ffdu <limf fdpn 


and hence, the statement. LC 


To prove the Theorem of Monotone Convergence for measures 
we need the notion of setwise convergence. 


2.11 Definition. Let (2, »’) be a measurable space and {ju} be a 
sequence of measures on (2, 5’). We say that {j,1,,} converges to a 
set function jz setwise if lim ju,(A) = p(A) exists for each A € 2’. 


The set function ju is called the setwise limit of {t,}. O 


2.12 Proposition. The setwise limit \1 of {{1,} has the following 
properties. 


(2) sis monotone and additive. 


(it) Let {Aj, Ao,...} be a sequence of pairwise disjoint sets 
from / and A, C A € 3. Then 


yepeiHAr) < BCA). (2.12) 
Proof. 
(2) Is trivial. 


(27) It can be verified directly from the definition of the setwise 
limit by using monotonicity and additivity or just due to Proposition 
1.3(22), Chapter 5. O 


We are wondering what condition imposed on a sequence { 1, } 
makes its setwise limit a measure. For instance, if the sequence {ju, } 
is monotone nonincreasing, then the limiting set function js need not 
be a-additive, as we learn from Problem 2.12. The following theo- 
rem provides us with some sufficient conditions. 
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2.13 Theorem. Let a sequence {{1,} of measures on a measure 
space (Q, 3’) be convergent to a set function {1 setwise. Then ju is a 
measure if one of the following conditions holds. 


(i) {pn} is a monotone nondecreasing sequence. 
(27) wis finite. 


Proof. Let {A;,} be a sequence of pairwise disjoint measurable 
sets with A as its union. 


(i) {x} is monotone nondecreasing, for each m= 1,2,... . 
Thus 


[lm (A) _ Yopeittm(Ar) = pag), 


which, combined with (2.12), yields the statement. 


(22) Because ys is finite, by Theorem 1.7(i2), Chapter 5, if ju is 
not o-finite (which we assume), it would not be @-continuous. In 
other words, there is a monotone nondecreasing sequence {A;}| @ 
of measurable sets such that limp.f(Ap) = €>0. Let 
a, = b; = 1 and suppose a, and b, are positive integers defined for 
all 7 < n. Furthermore, let a,41 > a, such that 


Le < HMan4s1(Ab,,)- 
[If there is no such a,.,, then it would surely contradict our 


assumption that lim; ...p(A;) = € > 0.] Now, let 6,4; > 6, such 
that 


1 
BE = Pani‘ Abai): 
(Such a 6,4; should exist, because ji,,,, 18 @-continuous.) For 


By: = Ay,\Av,,,. we have that j1,,,(Bn) > “2. Therefore for j 
being odd and 7 > k > 1, 


ba; ~ By 2 de, 
n even:n>k 
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Then, for k > 1, 


H( >, a.) > Se. 
n even:n>k 


We can easily verify that the last inequality holds true also for all odd 
values of n. Consequently, for all k > 1, 


wAy,) = n(O21Bs) > Fe. 


The latter contradicts the assumption that lim;_...~uW(A,) = ¢ > 0. O 


The next theorem is of monotone convergence for a sequence of 
measures. 

2.14 Theorem (Monotone Convergence). Let f € C,'(, ) 
and {{11,[2,-...} be a monotone nondecreasing sequence of 


measures on a measure space (QQ, 3”). Then there is a measure {4 on 
(Q, 3’, w) such that [,(A) — (A) for all A of ©? and 


im f fdpn = [fdp: (2.14) 


Proof. Because {1} is monotone nondecreasing, by Theorem 
2.13 (2), the setwise limit ju of {,u,} exists and is a measure on 
(Q, X’). In as much as f is nonnegative and j1,, 7 ju, the sequence 
{ { fdj } is monotone nondecreasing and hence 


lim f fdpm = limysoof fdtin < ffdp. (2.14a) 
The last inequality holds because of [ fdjin < { fd, which, in turn, 


is due to Problem 1.26. On the other hand, from Fatou's Lemma 2.9 
applied to our case, 


J fdu = littyos5f J dite 


which, combined with (2.14a), yields the statement. O 
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The convergence theorems below are for sequences of nonnega- 
tive functions and measures at once. 


2.15 Lemma (Fatou for Measures and Nonnegative Func- 


tions). Let {1 {11, [2,... } be a sequence of measures on a measure 
space (Q, 2’) and let { f,,} € Cz'(Q, SX’) such that for each A € &, 
lim p,(A) >= (A). Then 


Jfdy < lim f frdun, (2.15) 
where 
fw): = limf,(w), w EQ. (2.15a) 


Proof. First assume that {f,} C Cz!(Q, 2’). Then, for every 
fixed positive integer N and for every n, 


f fadin, > fat ine SN }djta: (2.15b) 


Applying the version of Fatou's Lemma 2.9 to the right-hand side of 
(2.15b) we have 


lim f frdjm > lim finf{ fm: m > N}dpm 
> finf{fn:m > N}du. (215¢) 


Because {inf{f,,: m > N}w} 7 f defined in (2.15a), applying the 
standard monotone convergence Theorem 2.1, we arrive at 


lim { fn@jte = Pan fins m > Nhdu = teas O 


The following generalization of Fatou's Lemma 2.15 is applied to 
arbitrary measurable functions { f;,} and its proof is left to the reader. 
(Problem 2.13.) 


2.16 Lemma (Fatou for Measures and Functions). Jn the 
condition of Fatou's Lemma 2.15, let {g, f1, fo,...} CC 1(Q, ») 
such that for all n, fn > g and 


limp oo fgdtin = fgdu > —oo. 
Then, 
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Jfdp < lim ffrdjin, (2.16) 


fw): = limf,(w), w € Q. O 


This is a further modification of the Lebesgue dominated conver- 
gence theorem for a sequence of measures and functions. 


where 


2.17 Theorem (Lebesgue's Dominated Convergence for 
Measures and Functions). Let {f,} C C71(Q, 2), g € Cy'(Q, Z), 
and {v, [M, 1, [2,...} be a sequence of measures on the measure 
space (Q, 3) such that: 


(i) fn Sv. 
(277) f, converges to a function f in the topology of pointwise 
convergence. 


(722) [ln converges to [4 setwise. 
(iv) Jgdv <o. 
(v) |frl <9. 


Then, 
litt ses] Jeli = [pap (2.17) 


Proof. Consider Theorem 2.10 for which we use the conditions 
(2), (422), and (iv). Then, applying Theorem 2.10 to g we have that 


limp oo f gd in — fod. 


Now, because g+ f,, > 0 for all n, we have from Fatou's Lemma 
2.16, 


lim f(g + fr)din > flim(g + fr)dp. 
On the other hand, because [gdju < [gdv < oo, 
lim f fd = ffdu = lim f frdpn 


which yields the assertion. O 
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PROBLEMS 
2.1 Prove Corollary 2.2. 
2.2 Generalize the monotone convergence theorem: Let {f,,}T 


2.3 


2.4 


2.5 


2.6 


2.7 


¢ C1, Z) and g € C-1(Q, X) such that f, > g for all n 
and suppose that [ g dj. > —oo. Prove that 


sup{ J fndu} = fsup{ fn} dy. 


Show that if ['gdju = —oo, the Generalized monotone conver- 
gence theorem need not hold. 


Let {fn} ©C' and g € C™ such that f;, < g for all n. If fg 
djs < oo, show that 


inf{ [ f,du} = finf{f,} dy. 


Let (Q, 3’, 41) be a measure space and let {A,,} C 2’. Prove 
that 


(lim An) < lim p(An), 
and if 44 < oo that 

(lim A,,) > Tim u(A,). 
Show that if f,, — jf in mean then 


Jitis rem J frde = J fdu 


Generalize Fatou's Lemma 2.4 in the following way. Let 
{fn} CC71Q,2) and g € C-1(Q,) such that g < f, for all 
n. Let [g~ dps < co. Show that 


flim fy dye < tim f fn dpe 
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2.8 


2.9 


2.10 


2.11 
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Let {fn} C C1(0,) and g € C71(Q,’) such that f, < g for 
all n. Let [g*du < oo. Show that 


lim ffpdu < flim fh dp. 


Let 


n, O<a< 7 
fr(X) = i 


0, t <2 <0. 


Show that f, —0O A-ae. in the topology of pointwise 
convergence. Explain why 


flim sco fn Mdx) F lim yo f fn M2). 


Let 
ne, Cae 
2 2 1 2 
jO=< =—wwes 2), {Hes ] 
2 
0, x > nt 
Show that 


flim fy dd < lim f fn dd. 


Explain why the Lebesgue dominated convergence theorem 
cannot be applied. 


Use Lebesgue's dominated convergence Theorem 2.6 to prove 
that for all a > 0, 


lin "= = 1 (a), (P2.11) 


n= 00 a(a—1)---(a+n—-1) 


where I'(a) is known to be the gamma function and is 
expressed as the improper Riemann integral 


[oe =, 62" ae (P2.11a) 
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2.12 Give an example of a monotone nonincreasing sequence of 
measures convergent to a set function js setwise such that pu is 
not a measure. 


2.13 Prove Fatou's Lemma 2.16. 


2.14 Let f € L(Q, 2, u;R,B(R)) and suppose that {Q,,} is a 
countable measurable partition of 2. Show that 


Jfdu a Dina J fe. 
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3. LEBESGUE AND RIEMANN INTEGRALS ON R 


In this section we develop integration techniques in L'(R, B, A;R) 
(see Definition 1.10(zi)). The key idea is to reduce the Lebesgue 
integral to the Riemann integral whenever it is possible in 
combination with the main convergence theorems. The Bernhard 
Riemann notion of an integral, which was a refinement from its 
inception by Augustin- Louis Cauchy in 1823, was introduced in 
1854. 


We begin with the concept of the Riemann integral of a bounded 
function on a compact interval suggested by the Frenchman Gaston 
(in some sources, Jean - Gaston) Darboux (1842 — 1917) in 1875. Al- 
though the construction below is self-contained, the reader may find 
it plausible to go over Section 9 (Example 9.9(v7)), Chapter 3, of 
[Dsh1] for topological preliminaries of this construction. 


Let 2 = [a,b] be a compact interval in R. By Definition 1.7(i2), 
Chapter 1, the partition of [a, b] is any ordered n-tuple P = P(n) = 
PGi; co<5 Gg) With 


PH{OGj 6229 Gn= [a,b]: a=a<a<-::: <a, = 5}, 


Let P; and P, be two partitions of [a,b]. We say P» is finer than P, 
if P, C Py. P, is also said to be a refinement of P, (in notation 
P, x P2). Thus, if P is the set of all partitions on [a,b], x is a 
partial order on P. 


Denote by C;'([a, 6], B([a,b])) = C;*([a, b], BN [a,b];R) the 
set of all real-valued, Borel-measurable, bounded functions on [a, }]. 
Let f be a function from C, '([a, b], B([a, 6])) and let P(n) = {ao, 
..+, Gn} be a partition of [a, b]. Introduce the following notation: 


Ag= G40). = 22.3, Ay = lagi |, 


m,; = inf{ f(x): « € Aj}, M; = sup{f(a): x € Aj}, 


L(f,P) = Yo_,mi(a; — a;-1) (the lower Darboux sum), 
U(f,P) = >>, Mi(a; — a;-1) (the upper Darboux sum), 
Gp = Dela, u(f,P) — Mila, 
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Now, a function f : [a,b] — R is called a step function, if there is a 
partition P of interval [a,b] such that f is constant on each subinter- 
val (a;-1,@;),i=1,...,n. Recall that, unlike a step function, a 
simple function is constant on each Borel subset (not just subinter- 
val, of [a, b]) in a finite partition of [a, b]. 

Denote by S(Ja, b]) = S({a, b], B([a, b]), IR) the space of all step 
functions on [a,b]. Clearly, step functions are Borel-measurable 


functions and / and u are elements of S({a,6]). Thus, L(f,P) and 
U(f, P) can be expressed in terms of Lebesgue integrals as 


L(f,P) = JUf,P)daA = Jif, P)dr 


[a,b] 
and 


U(f,P) = Ju(f,P)drA = fulf,P)dr 


[a,b] 
(in agreement with Notations 1.13). 

Now let {P(n) = P(ao,---,@n); 2 = 1,2,...} be a sequence of 
partitions of [a,b] such that {P(n), x } is a chain. Denote by 
|P(n)| the Lebesgue measure of the largest subinterval of P(n) and 
call it the mesh of this partition. A chain {P(n), = } is said to be 
canonic, if {|P(n)|} is a monotone nonincreasing sequence, vanish- 
ing whenn — oo. 


Let |, =I(f, P(n)) and u, = u(f, P(n)) denote the lower and 
the upper step functions corresponding to partition P(n) in a canonic 
chain. Then it can be easily verified that 


io Ue er Sf Sc le 2 hh. (3.0) 


Let G.=U0,P@))=—fiaa and Ig=— Lf, Pmjy= 
{l,dX. By monotonicity of the Lebesgue integral, we have 


Uh Sige SoS Te SL. 


Because f is bounded, there exist U_ = inf U,, = lim U,, (called the 
upper Darboux integral) and L, = sup L, = lim L,, (called the 
lower Darboux integral). 
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3.1 Definition. If U =L,, then their common value, 
R(f, [a, 6]), is called the Riemann integral of the (bounded) function 
f over [a,b], and the function f is called Riemann integrable. 
R(f, [a, b]) is also denoted by the symbol 


R) f f(x)dx 


[a,b] 


Sometimes, to tell a Lebesgue integral from a Riemann integral we 
write 
L) [ fle)de = f f(w)A(ae 
[a,b] [a,b] 

For notational consistency, most often we use the d\ symbol within 
the Lebesgue integral (rather than an “L” in front of it). However, 
many textbooks and papers routinely use the same symbol dz in 
Lebesgue integrals as in Riemann integrals, which we do not believe 
should cause any serious confusion. O 


3.2 Theorem. Let f € C,'([a,b], B([a,b])). If f is Riemann 
integrable on [a,b], then f € L*({a, b], B([a, b]), A;R), that is, it is 
Lebesgue integrable on |a, b|. In this case, the Riemann integral of f 
equals the Lebesgue integral of f. 

Proof. Let f be Riemann integrable. Then, lim. (U;, — En) 


0. Applying Fatou's lemma 2.4 to the sequence {u, —1,}C 
C;"([a, b], B({a, b])), we have 


0 < flim (tu, —In)ddA = flim (Un — In) dA 
< lim f(t — In) dd = lim(Uy — Ly) = 0. (3.2) 


Because of (3.0), (3.2), and the fact that uw, — f and f —1, are 
elements of C;', by Lemma 1.15 we have that 


u=limu, =1=liml, =f ae. (3.2a) 


Also, since f € C;'([a,b], B([a, b])) and l,, < f, it follows that 
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In € L*({a, ], B((a, b)), A;R). 


Now we can apply Lebesgue's dominated convergence theorem 2.6 
to the sequence {1,,} with respect to its a.e.-limit-function f to have 


mm CC 


fim, flndd = jim, Ln =(R) ff(w)de = f far. 
[a,b] [a,b] [aD] 


O 
3.3 Remarks. 


(2) The functions u = inf wu, and / = sup J, in (3.0) are called 
the upper and the lower Baire functions. Therefore, L+ is the 
Lebesgue integral of the lower Baire function /. 


(22) The above construction of the Riemann integral, which is 
now common in mathematical analysis courses, belongs to Gaston 
Darboux in his work Mémoire sur la théorie des fonctions discon- 
tinues of 1875. The original construction of the Riemann integral, 
that goes back to Augustin-Louis Cauchy (1789 — 1857) in 1823 (and 
later generalized by Riemann in his Habilitationschrift of 1854), is 
as follows. Given a function f € C;'([a, b], B([a, b])) and a partition 
P from a canonic sequence of partitions {P(n) = P(ao,..., an); 
n = 1,2,...} of the interval [a, b], define the Cauchy sum as 


Cf, P) = Via fda — ai-1), (3.3) 
where €; (called a tag) is any point of [a;_,, a;]. Note that, unlike the 
Darboux sum, the Cauchy sum is not specified because &;'s are 
arbitrary. If the limit 


C= lim) p(nyjoC SF, P(n)) (3.3a) 


exists aS a unique number, then f is called Cauchy integrable on 
[a, b] and the value of this limit is denoted by (C) i. f(x)dza. Clearly, 


ea aia 
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and therefore the Cauchy integral exists if LD, —=U_. In 1875, 
Darboux proved that this is also a necessary condition for the exis- 
tence of C and in this case, C =R. Darboux's theorem and his 
approach are subjects to standard texts in mathematical analysis, 
whereas Riemann's concept of the integral is more common in 
calculus classes as it leads to a quicker and more lucid interpretation. 
As a sufficient condition of the existence of the Riemann integral, 
Cauchy required that f be continuous on [a,b]. Riemann relaxed 
Cauchy's integrability condition by requiring that for each « > 0, 
there is a partition P of [a,b] such that U_(f, P) — Li(f, P) <e. 
However, Riemann did not specify the class of functions, which are 
integrable (although he pointed out that a function can be dis- 
continuous on a dense set and nevertheless integrable). Lebesgue did 
this in his widely referred to Theorem 3.5 which is to follow. O 


3.4 Example. Let f be the Dirichlet jump function introduced in 
Example 1.4. Consider its modification 


f(z) = Lonpo,1)(2) € Cz"((0, 1], B((0, 1))). 


The Lebesgue integral of f exists and equals zero. The Riemann 
integral of f, however, does not exist, in as much as for every 
partition, the lower Baire function equals 0 (J = 0) and the upper 
Baire function equals 1 (w= 1). Therefore, the lower Darboux 
integral D4 = 0, and the upper Darboux integral U_ = 1. O 


3.5 Theorem (H. Lebesgue). Let f € C,'([a, 6], B({a, b])). 
Then f is Riemann integrable on |a, b| if and only if f is continuous 
A-a.e. on [a, b]. 


Proof. 


(i) If f is continuous on [a, bj, then it is uniformly continuous on 
[a,b]. This implies that for each ¢ > 0, there is a 6 > 0, such that 
for each partition P whose mesh is less than 6, 


(Show it; see Problem 3.1.) This leads to Riemann integrability. 
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(ii) Let f be bounded, Borel-measurable, and -a.e. continuous 
on [a,b]. If f is not continuous everywhere, but is bounded, it can 
have only discontinuities of finite magnitude. From the nature of the 
lower and the upper Baire functions, / and wu, it follows that / and wu 
coincide with f at all points of continuity of f. (A rigorous proof of 
this statement, known as Baire's theorem, is contained in many 
standard analysis textbooks.) At the points of discontinuity of f, 1 
assumes the smallest values and wu takes the largest values (this can 
be shown by elementary methods; see Figure 3.1.) 


u(x) =1(x), x # Xp 


I(x) =u(x),x FX, Gu(x%) 


I(Xy) 


Xy 


@ 
Figure 3.1 


Then, if f is discontinuous on a negligible set S, it should follow 
that w and / differ on the same set S.. By the above condition, S C N 
where WN is a measurable null set. Because f is bounded, u,, and 1, 
are measurable, bounded jump functions, and U,, and L,, exist. By 
the Lebesgue dominated convergence theorem, U_ — Li = 0, which 
implies that f is Riemann integrable. Indeed, by Lemma 1.15 
(because |, < Un), 


U_—L, = lim U, — Ln = Jim, (un — In) dA 
f-Ddd =0, 


because u—1=00n N°, Le., u = 1 A-a.e. 
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(iit) Let f be Riemann integrable. Then, by Theorem 3.2 
equation (3.2a) holds: 


fF =i = limygsal, = 0 = Mga Wa 2. 


Furthermore, f is bounded. We utilize again the above arguments. 
From the nature of wu and J, it follows that, in this case, u,/, and f 
coincide wherever f is continuous. At all points of discontinuity, 
whereas f assumes one of these values, the smallest values of them 
are assigned to / and the largest ones to u. Therefore, the set, on 
which the function f is discontinuous, equals the set on which u and 
| differ. This proves that f is continuous 4-a.e. O 


3.6 Remarks. 


(i) By employing a canonic chain of partitions on the X-axis, in 
construction of the Riemann integral we sometimes face the problem 
that the sequence of the corresponding lower jump functions {I,,} 
converges to the lower Baire function /, but it does not converge to 
f, as is the case for the Dirichlet function. Consequently, the lower 
Darboux integral gives a “wrong” value. In contrast, the construction 
of the Lebesgue integral literally sets up partitions on the Y-axis 
whose canonic chains form monotone increasing sequences of 
“lower” jump functions. The latter, due to Theorem 5.5, Chapter 5, 
always converge to f. Consequently, the lower Darboux integral L, 
equals the Lebesgue integral [ fd. 


(zi) Although Riemann and Darboux enlarged the previously 
existing class of integrable functions, the Riemann integral has many 
limitations, one of which goes back to the fundamental theorem of 
calculus in the form 


(R)f. f'(a)dx = f(b) — f@. 


This formula becomes meaningless when a differentiable function f 
is not integrable. On the other hand, the classical proof of the formu- 
la 
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1 x 
dada f(u)du = f(x) 


was originally based on the continuity assumption for /. The new 
concept of integration suggested by Henri Lebesgue in 1902 in his 
doctoral work restored the generality of the fundamental theorem to 
its current status. Furthermore, the class of Lebesgue integrable func- 
tions is significantly enlarged. 


Notice that from Theorem 6.5, Chapter 5, it follows that, unlike 
the Cauchy-Riemann-Darboux formation of partitions of [a,b] 
essentially leading to Definition 3.1 or (3.3a), in the Lebesgue integ- 
ral of f, partitions on [a,b] are inverse images under f* of partitions 
of interval [0,supf] on the Y-axis. This approach (based on an 
explicit approximation of f by simple functions) provides a better 
detection and incorporation of fine variations of f (which Riemann 
integral does not) thereby allowing the Lebesgue integral to handle a 
much wider class of functions on [a,b]. Besides, defining the 
Lebesgue integral through the integral for simple functions enables 
far more powerful and easy-to-accomplish operations manifested by 
convergence theorems under more relaxed conditions compared to 
the Riemann integral. 


On the other hand, this deficiency of the Riemann integral was 
later on corrected by the British mathematician Ralph Henstock 
(1923 — 2007) in 1955 and, independently, the Czech Jaroslav 
Kurzweil (born 1926) in 1957 by refining ideas of the French Arnaud 
Denjoy in 1912 and German Oscar Perron in 1914, and leading to the 
gauge or Henstock-Kurzweil integral (cf. [Bart], [Gor], [Hen], 
[LeeP], [LeeT], and [Swa]). 


In short, the gauge integral G of f on [a,b] is defined as follows. 
Given an ¢ > 0, if there is a function [{a, b], (0,00), 6] (called a 
gauge function) such that, for any partition a = ag < aj < ++: < Gy 
= b and any choice of tags €; € [a;_1, a;] satisfying aj;_; — 6(a;_1) 
<4 = GS a Xap 0G) 0 = Lyssa gt, 


IG = af (&:) (ai — a1)| < € 
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holds, the gauge integral G is said to exist. In the definition of the 
Riemann integral, the gauge function 6 is constant, which is the core 
defect that makes the Riemann integration less flexible. There are 
more gauge-integrable functions on [a,b] than Lebesgue- and “im- 
proper” Riemann-integrable functions combined. Some advocated 
even that both Riemann and Lebesgue integrals be replaced with the 
gauge integral in calculus and analysis courses. However, it did not 
bring in a broad support for many good reasons (which are beyond 
the scope of this book). Most importantly, the Lebesgue integral in 
the advanced literature on real analysis and measure theory, is often 
used as a mere illustration to the mighty abstract integral that do- 
minates stochastic and functional analyses. 


(iii) As we mentioned, in order that a function be Lebesgue 
integrable, it need not be bounded. A class of Riemann-integrable 
functions, as known, can be “extended” for unbounded functions by 
the use of the “improper integral.” Another need for the improper 
integral arises when the interval of integration is unbounded. In the 
latter case, the integral is constructed as usual on a compact interval 
(a, b], and then its values are taken for a + —oo or b > ov. This is a 
trick rather than a conventional integral construction. (See the further 
implication of this method below.) That is why such integrals are 
called improper. 


(tv) Unlike improper integration over infinite intervals, there is 
another way to integrate functions with the conventional approach of 
constructing an integral via uniform “partitions” of an infinite inter- 
val. Consider for example, a bounded Borel measurable function f 
on interval [a, co), and partition this interval by the sequence {a,, }, 
where a, =a+6n, n=0,1,..., for some positive 6. Then on 
each of the intervals A, = [ay, @n41) consider 


my, = inf {f(v7):a2e A,} and M, =sup{f(c): a € Ap}. 


Because the Lebesgue measure of each interval A,, equals 6, we 
have again the lower Darboux sum, 


L(f, 6) = Oy n>0 Mn » 
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and the upper Darboux sum, 
U(f, 6) = Oy n>0 Mn . 


If ie RG ee ne U(f,6) then its common value is denoted by 


Dif. fia\dz 


and called the direct Riemann integral. The function f is then said to 
be directly Riemann integrable. The direct integrability is used in 
probability, specifically in renewal theory, where such a notion is 
introduced for a class of nonnegative functions bounded over finite 
intervals. We not that the class of directly integrable functions is 
essentially more narrow than the class of Riemann integrable 
functions in the improper sense. O 


3.7 Examples. 


(i) Let Q = [0,1] and let f(x) = x71,4(x) + sing 14-(x), where 
A® is the Cantor ternary set. The function f is a bounded Borel- 
measurable function on [0,1] and obviously A-a.e. continuous on 
[0,1]. Thus, f is Lebesgue as well as Riemann integrable and 
f(x) =x? »-a.e. on [0, 1]. Furthermore, 


R) J f(a)dx = (L) f f(@)A(de 


[0,1] [0,1] 
L) [#*A(dz) =(R) [a*de = 
[0,1] [0,1] 
ae a Q = [1,2] and f(x) = (2 — 1)~/3. We wish to evaluate 
J {@) ). Because f is no longer bounded (on [1, 2]) we cannot 


"iy the same techniques as discussed above. Consequently, we 
introduce an auxiliary sequence of functions, { f,,}, defined as 


ee n, l<a<1l+3 
4 ie 
" (cx-1)-8, 14+ 4<a<2 
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(see Figure 3.2). It is easily seen that { f,,} is a monotone increasing 
sequence of continuous functions contained in C;'({1,2], B({1, 2])) 


with sup{f,} = f. 


Figure 3.2 


By Proposition 5.6 (iv), Chapter 5, f € C;'. By the monotone con- 
vergence theorem, 


J f(@)A(dx) = sup f fr(x)A(d2). 


[1,2] [1,2] 
On the other hand, 


| jndA = (BR) if fatale 


[1,2] [1,2] 


=(R)f ndx+(R)f (w-1) 8dr =3-3.- 4. 
[1,1+1/n?] [1+1/n3,2] 


Thus, 
sup f fn(z)A(dx) = 3. 


[1,2] 


Observe that the Riemann improper integration technique for un- 
bounded functions could also be applied to this function. O 
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3.8 Remark. As mentioned, the Lebesgue integrable functions 
form a much wider class compared to the Riemann integrable 
functions. Also recall that an L(IR, B, )-function f can be integrat- 
ed over arbitrary Borel sets, whereas the Riemann integral is defined 
just on intervals. With all these benefits, however, the Lebesgue 
integral does not have the same elegance and analytical tractability 
the Riemann integral has through to its “Newton - Leibnitz bridge” to 
derivatives and antiderivatives and a huge reserve of integration 
techniques. In many cases, whenever possible, the Lebesgue integral 
is just reduced to a Riemann integral. In addition, the class of 
Riemann integrable functions is enlarged through the functions that 
are Riemann integrable in an improper sense. This class includes 
functions with discontinuities of an infinite magnitude and functions 
defined on intervals of type [a, co) or (—0o, b] or (— 00, 00). 


In Example 3.7(i7) we examined a Lebesgue integral of an 
unbounded function. In a certain sense, the approach used there is 
reminiscent of Riemann integration of unbounded functions. In the 
proposition below we state that in most cases, when the integration 
over an infinite interval is needed, we can use Riemann integration in 
the improper sense and equate their values to those for Lebesgue 
integrals. This fact makes the Riemann improper integral fully legiti- 
mate. 


3.9 Proposition. Let f € rea R,B;R,) let f be Riemann in- 


tegrable on any compact interval. Then f € L'(R,B,;R+) if and 
only if the improper Riemann integral of f , 


R= lima ——oc J f(a)dz, 


7% [a,b] 


exists. [We say that f € R(IR), where R(IR) is the class of all 
functions on RR Riemann integrable in the improper sense.| In this 
caseR = [ fdX. 


Proof. Denote 


Raz =(R) ff(x)de where Br, = [-&, nl. 
B 


nk 
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Then, because f is Riemann integrable, R,,;, = { f1z,,dA. Observ- 
ing that 


fScupl fle, (9 2 jas ha 1250}, 
we have, by the monotone convergence theorem, 
{fd = sup Ry, = R < 0. oO 


3.10 Remark. The special case treated in the above proposition 
and applied to nonnegative Borel measurable functions can easily be 
extended to arbitrary functions of C~! by noticing that f € 
D1(R, B, 4;R) if and only if |f| € L'(R, B, A;R). Therefore, using 
Proposition 3.9, we conclude that |f| must be an element of R(R). 
In this case, evidently, 


C50 nea gO ee 
= (R) fo, f* (a) da — (R) fr, f(a) da 
= [ftddr—ffod\= ffaa. Oo 


3.11 Examples. 


(i) Consider the function f(x) = aoe (where k #0). We 
show that this function is Riemann integrable in the improper sense 
but not Lebesgue integrable over R;. We apply the Dirichlet criter- 


10n: 


Let g and h be two real-valued functions defined on 
[a, 00). If g is monotonically vanishing at co and | (R) 
f h(a)dx| <C, for each b>a and positive real 
number C (i.e., the integral of h is uniformly bounded 
in b), then the improper integral (R) {gh is conver- 
gent. 
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In our case, the function gF>z can be taken for g and sin x can re- 
present h. Then, the conditions in Dirichlet's criterion are met for 
a = 0, and consequently, (R)f;°f converges. On the other hand, 
f € L'(R, B,A;R) if and only if |f| € L1(R,B,A;R), which, by 
Proposition 3.9, is equivalent to the convergence of the integral (R) 
Jo. \f|: =. It is shown that I = oo. Indeed, 


Fe So (n+1)7 |sina|x re 


nt k2 +a? 


-_ =v T = ae aS anf, sint dt 
n=0 JO k?+(na+t k?+7?(n+1)? 


(the second summation is due to the inequality 7m +t < (n+ 1), 
for t € [0, 7]). 


Thus 


oo 27n = 
ie ae k?+n?(n4+1)? oh 


(it) The function f(x) = sinxexp( — =) is an element of C7! 
and it is Lebesgue integrable, because |f(x)| < g(x) = exp(—*) 
and g(x) > 0 and because 


Ta Jolejde=1. 


Observe that x > Ta exp(— 5), x €R, is the normal density 


function of the standard normal distribution. [See Example 5.10 
(ii2).| O 
PROBLEMS 


3.1 Prove (3.5) in Theorem 3.5. 


3.2 In Example 3.4, we showed that the Dirichlet function f on 
[0,1] is Lebesgue integrable, but not Riemann integrable. 
Because the rationals have the Lebesgue measure of 0, the 
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3.3 


3.4 


i F 


3.6 


a7 


3.8 


3.9 


function f is equal to 0 (a constant) for A-almost all points on 
[0,1], and therefore, it is continuous almost everywhere on 
(0, 1]. By Theorem 3.5, f must be Riemann integrable. This is 
just the opposite of the result of Example 3.4. What is wrong 
with this reasoning? 


Is the function f(x) = + on [0,1] Borel-measurable and )- 
integrable? Is f € L([0, 1], B([0, 1], A))? 


Show that the function f, such that f(a) = 4+ cos(+) on (0, 1] 


and f(0) =0, is Borel-measurable and not -integrable on 
[0, 1]. 


Let f: [0,1] — R be defined as 


f(x) = (—1)"*!n, ae A,=(s5, +), n=1,2,... 
0, x0. 


Show that f is improperly Riemann integrable but not 
Lebesgue integrable. 


Let f be a monotone increasing differentiable function on 
[a,b] and let w be its inverse function on [f(a), f(b)]. Prove 
that 


fr f@Xdx) = fin ye" @)ACdy). 


Investigate if the function f(«) = siz 1;,40}(@) (where 


ge 
0 <a<1) is improperly Riemann and Lebesgue integrable 
over IR. 


Let G be a nonempty open subset of [a, b] and let f be a Borel 
measurable function on [a,b], discontinuous at each point of 
G. Can f be Riemann integrable? 


Show that the functional 


414 


3.10 


3.11 


3.12 


3.13 


3.14 
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fll =f lflaa 


defines a seminorm on L'([a, b], B([a, b]), X). How can || - || 1 
be “made” a norm? 


Let s € W,({a, b], B({a, b])). Show that for each « > 0, there is 
a continuous function h € C([a, b]) such that 


|s—All pn <e. 


Show that the space C([a,b]) of all continuous functions on 
interval [a, b] is dense in (L1([a, b], B([a, b]), A), || - |lz1)- 


Use Lebesgue's Theorem 3.5 to show that the limit of a uni- 
formly convergent sequence { f,, } of bounded Riemann integr- 
able functions on [a, b] is Riemann integrable on [a, b]. Prove 
that under this condition, 


lim (R)f” fn(a)da = (R)f” lim fu(w)dx. (P3.12) 


n+ CO 


Let A be a subset of [a,b] whose closure is negligible w.r.t. 
Borel - Lebesgue measure X. Is the function g = 14 Riemann 
integrable on [a,b]? If it is Riemann integrable, then calculate 


(Rf. g(a)dx. 


Let {f,} be a sequence of bounded, Borel measurable, 
nonnegative functions on A C R. Suppose (L) [ f,dA — 0 for 
A 


n — oo. Is it true that f,, — 0 A-a.e. on A? 
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L(f,P), lower Darboux sum, 399 

U(f,P), upper Darboux sum, 399 

step function, 400 

S((a, b], B({a, b]), IR), space of step functions, 400 
mesh of a partition, 400 

canonic chain of partitions, 400 

upper Darboux integral, 400 

lower Darboux integral, 400 

Riemann integral, 401 

Riemann integrable function, 401 

criterion of Lebesgue integrability, 401 

upper Baire function, 402 

lower Baire functions, 402 

Cauchy sum, 402 

Cauchy integrable function, 402 

Dirichlet function, 403 

Lebesgue's Theorem of Riemann integrability, 403 
Henstock, Ralph, 406 

Kurzweil, Jaroslav, 406 

Denjoy, Arnaud, 406 

Perron, Oscar, 406 

gauge integral, 406 

Henstock-Kurzweil integral, 406 

gauge function, 406 

improper Riemann integral, 407, 409 

direct Riemann integral, 408 

direct Riemann integrability, 408 

Dirichlet's criterion, 411 

(L1({a, b], B({a, b]), A),]| - ||), L1-seminormed space, 414 
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4. INTEGRATION WITH RESPECT TO IMAGE 
MEASURES 


In this short section we challenge the claim (on a small scale though) 
that the Riemann integral has an overwhelming variety of techniques 
making it superior to the Lebesgue integral. Besides, we focus on the 
abstract integral. As one of the extensions of major integration 
techniques, we study integration with respect to image measure j.F* 
(where F' is a measurable mapping), under the nickname change of 
variables, as it resembles the prominent method for the Riemann 
integral. A more specific approach to a change of variables for 
Lebesgue integrals in n-dimensional Euclidean spaces is treated 
separately in [Dsh2]. 


4.1 Theorem (Change of Variables). Let (Qo, ’o, 4) be a mea- 
sure space, f €C~'(Q, 2), and F: (Qo, Xo) — (Q, X’) be a meas- 
urable map |such that .F* is an image measure on the measurable 
space (Q, &”)|. Then, the following formula holds true. 


Sf(w) duF*(w) = J fo F(wo)du(w). (4.1) 


Specifically, if f = g14, where AE XY and g€C1(Q,), then 
(4.1) reduces to 


Jow)duF*(w) = f go F(wo) du(wo). (4.1a) 
A peta) 


Proof. 


(i) Let s € W,(Q, »’) be just an indicator function s = 14. By 
Problem 3.7, Chapter 1, we have that 


so F(wo) = 14 0 F(wo) = 14) (0). 


Therefore, 


f14 0 F(wo) du(wo) = f1r+(4) (wo) dp(wo) 


= p(F*(A)) = wF*(A) = fla(w)duF*(w). 
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(ii) Let s be a nonnegative simple function with the representa- 
tion, 


s=) aj)1a,,a; € Ry. 
i=l 
Then, 
so F(wo) = (Dats ) o F'(w) 
i=l 
= Vial, 0 F(wo) = laid (4, (wo) 

i=l i=l 

and 


fsoFdp= dai WE" (Ai) = [sap 


(it7) Let f € C{'(Q, X’). Then there exists {s,}? CW, such 
that f = sup{s,,}. For s,, we have, according to (71): 


js, 0F dp = [s,dpF". 
Observe that {s,,0 F} CW, (Qo, Xo) and, by Proposition 5.6(iv), 
sup{s, oF} = fo F € Cz! (No, 2). 
Therefore, we have that 
ffoF dw =sup{ fs,oF dy} 


= sup{ |s,ank*) = [fF dak. 


(iv) Let f €C1(Q, X’). Then, f = ft — f~ and, according to 
Problem 4.1, 


(foF)+ =ftoF and (foF) = fo oF. 


Therefore, 
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foF =(foF)+-(foF) =ftoF-f-oF, 
and this, along with (222), implies that 
SfoFdu = f(foF)*du— f(foF) du 
=fftoFdu-— ff -oFdp 


= [ftduF* — [f-oF*dp = ff duF*. 


(v) Let f =gl4 where A€ XY and g €C71(, »’). Then we 
have 


fodpF* = fgladuF* = [(goF)(140F)du 
A 


F(A) 


4.2 Corollary. Let (Q, 2’, 4) be a measure space and let 
PF: (Q, 37) > (Q, Z) 


be a bijective transformation that is 3/-) measurable along with its 
inverse F*. Then, for each f €C71(Q,%), the following formula 
holds true. 


[ fdu= ffoF*duF*. (4.2) 
F*(A) A 


(See Problem 4.2.) 


4.3 Example. Let (9, ’,P) be a probability space and let 
X €C71(Q, X) be ar.v.. Recall that X induces the image measure 
PX*, or, equivalently, the probability distribution on the measurable 
space (iR,B), thereby generating the new probability space 
(R, B, PX*). The functional of X, [ XP, was called [in Definition 
1.10(iv)] the expectation of the r.v. X and denoted by symbol EX. 
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Let g € C'(R, B). Then, go X is also a r.v. whose expectation can 
be expressed as 


Elgo X] = [go X(w) P(dw). 


By formula (4.1), we have 


Elgo X] = fo(x) PX*(dz). (4.3) 


Specifically, if g(x) =x, we have EX = [aPX*(dz). If g = 1a, 
then from (4.3), Notation 1.13, and Definition 4.2, Chapter 5, 


E[14 0X] = f PX*(dr) = PX*(A) = P{X € A}. (43a) 
A 


O 


PROBLEMS 

4.1 Show that (foF)t = ftoF and(foF) =f ~ 

4.2 Prove Corollary 4.2: Let (Q, 2’, 4) be a measure space and let 
F:(Q, 37) — (Q, 2’) 

be a bijective transformation which is 37-3) measurable along with 


its inverse F*. Then, for each f € C-+(Q, X), the following formula 
holds true. 


{ fdu= ffoF*duF*. (4.2) 
F*(A) A 


4.3 Let f €C1(R”, B"), aE R, b € R", and B € B”. Show that 


J flaw +d)A(da) = pp [f(w)A(de 
B A 


where A=aB+b. 
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4.3 Simplify f f(e’")A(dz), where f €C'(R,B;R) and 
A 
A=[L,2]. 


4.4 Use the change of variables formula to evaluate the integral 
J f(2% + 1)X(dx), where 
A 


gz’, 2rEQ 
coo, «rEQ 


fia) = { 


and A = [1,3]. 
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NEW TERMS: 


change of variables formula, 416 

change of variables for a bijective transformation, 418 
expectation of a random variable, 418 

expectation of a function of a random variable, 419 


change of variables formula with respect to the affine transformation, 
419 
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5. POSITIVE MEASURES GENERATED BY 
INTEGRALS. ABSOLUTE CONTINUITY 


In this section we learn that the integral [ fdj, as a set function, say 
A 


v(A), turns out to be a measure. Hence, the two measures, ju (the 
original measure) and v (generated by the integral), are related 
through the given integrand-function f, which is referred to as a 
density. Conversely, under what condition imposed on two arbitrary 
given measures can a density function (that binds them through the 
integral) exist? The question raised leads to one of the central results 
in measure theory and integration, known as the Radon - Nikodym 
theorem, which specifies exactly that condition. 


This section gives a very brief and informal acquaintance with 
the Radon-Nikodym theorem and its ramifications needed to 
advance to the upcoming material and serving as an introduction. A 
more elaborated and general version of the Radon - Nikodym theo- 
rem is treated in Section 8. 


Let (Q, 4’, 1) be a measure space. Consider the integral 
A- ffdu=ffladu 
A 


as a set function on 2’. If f > 0, then as the following proposition 
asserts, we have a measure on 2’. 


5.1 Proposition. Let (Q, 5’, ~) be a measure space and let f € 
C,'(Q, 2’). Then, the set function v(A) = f[ fd is a measure on 
A 
pa O 
(See Problem 5.1.) 


5.2 Definition. According to Proposition 5.1, v is a measure 
generated by integral { f dj; v is also called the indefinite integral 
of f with respect to i. The function f is called a (Radon - Nikodym) 
density function of v relative to 1. O 


The following examples motivate the Radon - Nikodym theorem. 
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5.3 Example. 


(i) Let (Q, 2,4) be a measure space and let yw be o-finite. 
Then, there exists a sequence {A,} 1 Q such that u(A,,) < oo. 
Define the sequence {a,,} C R;\{0} as 


eae 1 1 = 
An = min{ WA,)a" > 3 \, MS 2 ewe 3 
Let 


Yoajly,. 


In = Yala and C= sup{ gn } = 
i=] i=l 


Then, 
Jgodu =sup{ fondu} = dait(Ai) <Vigal 


i=1 


Therefore, if j: is o-finite, there always exists a positive element 
g from L1(0, X,). Conversely, let g > 0 and ge L1(Q, 2, p). 
Then 


and gn > 1,,. Thus 
U(An) = Jla,du = nfgdu < Oo, 


which implies that p(A,,) < oo. Since g > 0, it follows that A, ft Q. 


(it) We have shown that o-finiteness of j1 is equivalent to the 
existence of a positive integrable function g. In other words, there is 
a positive “Radon -Nikodym density” g such that the measure v 
generated by the integral is finite. Another noteworthy observation is 
that if 


O=0(A) = foladu, 


then, because g14 > 0, by Lemma 1.15, gly € [0],,. In as much as 
g>0, AEN, ie., from v(A) = 0 it follows that (A) = 0. 
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On the other hand, should ju(A) = 0, then (due to the convention 
0- co =0) gly € [0],, and v(A) = 0. Thus, (A) = 0 if and only if 
(A) = 0. In other words, v and py possess the same null sets. 


However, if g is just nonnegative, v(A) = 0 does not necessarily 
imply that (A) = 0. But from (A) = 0, it follows anyway that 
v(A) = 0. O 


We saw two different ways when measures and densities are 
related. Firstly, given a nonnegative measurable function f and a 
measure j we have measure v = [ fd. Secondly, in the above 
examples, given a o-finite measure ju, we designed an integrable 
function g > 0 such that v = [ gdj was a measure. In both cases, a 
ji-null set is also a v-null set, in notationy < pu. 


In general, if 1 and js are two measures such that v < pu does a 
density function f of v relative to 4 always exist? The answer is 
given in the Radon - Nikodym theorem below. 


5.4 Definition. Let ,, and v be two measures on a measure space 
(Q, 37). The measure v is called (absolutely) continuous with respect 
to ys if every s-null set is also a v-null set. In this case, we write 
v <p. Any Borel measure continuous with respect to the Lebesgue 
measure is just called continuous. O 


The use of the word “continuity” is due to the following propo- 
sition. 

5.5 Proposition. Let v be a finite measure on (Q, 2’) and let ju 
be another measure on (Q, 5). Then the following are equivalent: 


(A) v< yp. 

(B) For all ¢ >0, there is 6 >0, such that for each A € X 
with (A) < 6, the inequality v(A) < « holds. 

Proof. 


(i) Suppose statement (B) is true. Choose an ¢. Denote by A 
the set of all A € 2’, for which (A) < 6. Then NV, C A (where Ni, 
denotes the subset of all ju-null sets). Then, for all N € Ni, 
0 = uw(N) <6 and v(N) < «. Because € can be made arbitrarily 


small, we conclude that v(N) = 0 and thus v < pu. 
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(ii) Suppose now that statement (B) is not true. That means, for 
some ¢ > 0 and for any 6 > 0 there is a set A(5) € »’ such that 
p(A(d)) <6 inpiics that v(A(6)) > ¢. We now define the sequence 
of 6's as 6, = x , N=1,2,..., and construct the corresponding 
sequence of A's such that A(é,,) = A, with the above property, that 
is, {A,,} is a 4-monotone decreasing sequence but “‘v-resistant.” Let 
A= lim A,. Then A C [J A,, and 


m=n 


u(A) < u( U U | Am) 25 WA) = Siete. 


Therefore, 4(A) = 0. However, by Problem 2.5, because v is finite, 
v(A) = v(lim A,) > limy(A,) > ¢ > 0 


and thus v is not ;1-continuous. Hence (A) is not true either. O 


The most general version of the celebrated Radon - Nikodym 
theorem was proved by the Pole Otto Nikodym in his paper, Sur une 
generalisation des integrales de M.J. Radon of 1930. Another 
prominent Pole, Stanislav Saks, suggested the name of this theorem, 
perhaps meaning Nikodym's theorem on Radon integrals, although 
Radon himself proved a much more special case. 


The idea of Radon - Nikodym's result had its inception in an 1884 
paper by Thomas Stieltjes, in which he introduced the new concept 
of a density function in connection with his famous “Stieltjes 
integral” (in its present version known as the Riemann - Stieltjes 
integral) and initially applied to very restricted classes of functions. 
In 1909, Frigyes Riesz proved in his widely referred to representation 
theorem that Stieltjes integrals are represented by the most general 
continuous linear functionals on [a,b] (whose more general version 
is explored in [Dsh2]). 


Riesz's result yielded many generalizations, of which the most 
fruitful was by Johann Radon [Rad] in his 1913 paper, “Theorie und 
Anwendungen der absolut additiven Mengenfunktionen.” In this 
paper, Radon, combining the ideas of Lebesgue and Riesz, intro- 
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duced an integral with respect to Borel measures on the Borel o- 
algebra in IR” rather than the Borel-Lebesgue measure used by 
Lebesgue. Among other things, Radon showed the existence of a 
Radon - Nikodym density function with respect to this integral as an 
absolute continuous measure relative to the Borel- Lebesgue meas- 
ure, significantly generalizing the earlier theorem by Lebesgue about 
the existence of an almost everywhere differentiable density. 


Right after the appearance of Radon's paper, Maurice Fréchet 
noticed that Radon's result can be generalized for arbitrary measures, 
rather than Borel measures on B(R"). This led Nikodym to his 1930 
generalization of Radon's theorem in the form very close to the pre- 
sent version. Consequently, a significant gap in integral theory exis- 
ted between 1913 and 1930. Soon thereafter, in 1933, Nikodym's ge- 
neralization led to the birth of measure-theoretic probability theory 
(in Andrey Kolmogorov's famous monograph, Grundbegriffe der 
Wahrscheinlichkeitsrechnung), the concept of conditional expecta- 
tion, and an introduction to the theory of stochastic processes. Still, 
many consider Radon to be the father of the modern theory of 
integration. 


Otto Nikodym, who is at the heart of one of the most significant 
results ever made in mathematics, was born on August 13, 1887, in 
eastern Poland, then belonging to the Russian empire. In 1919 he 
was among 16 mathematicians who founded the Polish Mathematical 
Society. Shortly after World War I, Nikodym's family moved to 
Belgium and then to France, where Nikodym was invited by the 
Institute of H. Poincaré to work on the mathematical foundations of 
quantum mechanics. (He published his results in numerous papers, 
and his monograph, The Mathematical Apparatus for Quantum 
Theories, was published by Springer-Verlag in 1966.) In 1948, he 
accepted a position in the United States at Kenyon College, Gambier, 
Ohio, where he stayed until his retirement. He died in 1974. 


Johann Radon was born in Bohemia, Austria-Hungary, now 
Czech Republic, on December 16, 1887. He received his doctoral 
degree at the University of Vienna in 1910. Besides his best-known 
work on integration, Radon applied the calculus of variations to 
differential geometry that led to applications in number theory. He 
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discovered curves which are now called Radon curves. Next he 
worked on convex functions and sets and on the determination of 
functions from the values of their integrals on certain manifolds that 
contain the Radon transform and play an important role today, 
especially in medicine and geophysics. His wide interests also 
brought him to Riemannian geometry and geometrical problems that 
arose in the study of relativity. 


Between 1919 and 1955 Radon had numerous appointments at 
Austrian and German universities. (In 1954-1955, he served as 
president of the University of Vienna.) In 1947, Radon became a full 
member of the Austrian Academy of Sciences. From 1948 to 1950, 
he was the president of the Austrian Mathematical Society. He died 
in Vienna in May 1956. 


We introduce some preliminaries on the Radon-Nikodym 
Theorem (further embellished in Section 8). 


5.6 Notation. Let It = MQ, a’) be the set of all (positive) 
measures on ((2, 5’). For a fixed measure pp € IM, denote a = 
{v € M:v < p}. (This set is not empty, because ps € Mis.) Define 
on C71 (Q, 3’) = C1 (Q, ©; R,, B(R,)) a mapping J, Such that for 


each f € Cj}, 


If =Sfdu=v(-). 
() 


By Lemma 1.15 [for f €C{'(Q, 2), ffduw=0 if and only if 
f € [0]). In particular, whenever (A) = 0, laf € [0 (under the 
convention that 0-00 = 0) + v(A) = f1lafdu = 0], we have that 
for each measurable function f > 0 and for each measure js on »’, 
not only is { fds a measure, but it is also continuous with respect to 
() 
jt. Thus, 
ti °C. (Qa) 


is an into map. O 


Now the Radon-Nikodym theorem asserts that if ~ is o-finite, 
for each v € IN, there exists a unique (up to the equivalence class 


He? 
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modulo j:) Radon -Nikodym density f € Cz'(Q, 2’) of v relative to 
jt. This needs some clarification: 


1) Given a function f € C7'(Q, 2), i, _f defines a measure that 


is absolutely continuous with respect to jz. As noticed above, this is 
done. Consequently, [C,'(9, ©’), Mts, f/,] is an into map. 


? 

2) Recall [Definitions and Remarks 5.7(i22), Chapter 5] that the 
ji-almost everywhere property of equality of measurable functions 
generates an equivalence relation E,, (the equivalence kernel of j1) on 
C~'(Q, 2; R) and thus on C7!(Q, X’), as a subset of C~1(Q, YR). 
Consequently, C;'(Q, 5’)|,, is a quotient subset of C'(Q, 47; R)|,, 
modulo E,,, also referred to as modulo ts. On the other hand, by 
Corollary 1.20 [If u is o-finite, f,g€ L(Q,X,u;R), and ffdy 

A 


pe 


= gdp, for each A € 3, then f = g pra.e. on QQ], the mapping Ji 
A 


generates E,, as its equivalence kernel. Then, by Theorem 4.4, 
Chapter 1, there is a unique function 


(eat, D1) |, = (Q, Di i; R R Vas Te i ik 


such that 


ti ~ Si. ° TE: 


where 7 stands for the projection of C,'(Q, =’) on its quotient 
C,*(Q, ¥)|,, by E,. (See Section 4, Chapter 1.) 


Therefore, J, , literally turns to injective map Si (the reducer of 


i. ) that now operates on the quotient set L(Q, 2’, ;R+) | :.. 


3) The major claim (existence) of the Radon - Nikodym theorem 
is that the mapping [C,'(Q, ’)|,,, MUS, J/,] is also surjective. In 
other words, for each measure v € a) ae (i.e., absolutely continuous 
with respect to j1), there is an equivalence class [f],, of Radon - 


Nikodym densities of v relative to j. 


A compact version of the above arguments is as follows. 
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5.7 Theorem (Radon-Nikodym). Let  € WN(Q, X’) be a o- 
finite measure. Then [C,*(Q, ©)| ,, IVS fi | is a bijective map. 


wr? 


As mentioned, the uniqueness of the Radon-Nikodym density 
class is due to Corollary 1.20. The rest of the proof of Theorem 5.7 
(existence) is rendered in Section 8, for more general classes of 
signed measures. O 


By Radon-Nikodym's Theorem, the map I is therefore in- 
veavle and its inverse, denoted by symbol 7 , 1s also a map valued 
itr ae ia (Q, X')|,,. Thus, for any v em<, 


equivalence class [ f],, of Radon - Nikodym densities of v relative to 
jy. and, for a fixed v € INS, we write 


there is a nonempty 


fu? 


ie = fa = Si Av}) 


and call it the Radon - Nikodym derivative of measure v relative to 
the measure 1. It should be clear that, unlike a Radon -Nikodym 
density, the Radon - Nikodym derivative “ is a j1-equivalence class 


ds 


of all Radon - Nikodym densities with respect to measure v. 
5.8 Examples. 


(i) Let Q be an uncountable set; let ©’ = {A € P(Q): either 
A or A® is countable}; and let v(A) =0 if A is countable and 
v(A) = oo if A® is countable. Let 4 be a counting measure on 3’ 
[ie., let (A) = |A| if A is finite]; otherwise, ~(A) = oo. Because 
the only p-null set available is @, it immediately follows that 
v < pt. However, we show that : cannot have a oy relative to ju. 
Assume the opposite. Let g € C7" be such that g ev . Then, for all 


w € 0), we have that: 


v({w}) = fo(a 
{w} 


= J1puy(w) g(a) u(dx) = f14. (x) g(w)u(da) 


w) J 1p (@) (da) = g(w)u(w) = gw). 
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This implies that g = 0, which, in turn, yields that v = 0. This is a 
contradiction. (For a further discussion see Problem 5.3.) 


(it) Let €, be a point mass on (R",B). We are interested in 
whether ¢, < A. Let B= {a}¢B. Then, \({a}) =0 whereas 
Eq({a}) = 1. Therefore, by the Radon - Nikodym theorem, ¢, does 
not have any density relative to A. 


(iii) Let A be the Borel - Lebesgue measure on (IR, B) and let v 
be a probability measure on the same measurable space such that 
v <A. Because is o-finite, by the Radon-Nikodym theorem, 
there exists a density, f € S called the probability density of v, 
such that v = { fd. Should v be a probability distribution, say 
PX*, induced by a random variable X € C~1(Q, 3’) that is absolute- 
ly continuous with respect to the Borel - Lebesgue measure, then f is 
referred to as the probability density function (pdf) of X. 


Let X: (Q, 57) — (R,B) be a random variable with PX* = 
| f(a,o7) dX, where 


ad 


f(a07;@) = | 2 exp{ _ wh, LE 


270 


Then, X is called a normal random variable with parameters (a, 
o”); the corresponding probability density function of X, f(a,o7) 
is called the normal density. O 


We discuss some more results related to the Radon - Nikodym 
theorem. 


5.9 Definition. Let 1; and j12 be two measures on a measurable 
space (Q, %’). ju; 1s said to be singular relative to {12 (or orthogonal 
to [t2) if there is an A € »’ such that 


(A) = p2(A°) = 0. 


In this case we write juy L jo. O 


It should be clear that the orthogonality relation is symmetric. 
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5.10 Examples. 


(i) Let €, be a point mass on (R”,B) with a € R”, and let 
A= {a}*. Then, A({a}) =e,(A) =0. Therefore, e, 12. In 
Example 5.10(iz), it was shown that €, is not absolutely continuous 
relative to 1. Now we have established another relation between €, 
and X. 


(i7) Let pp =ceg+bdA, where c,b € R,\{O}. It is clear that 
cég LX and bA < A, which implies that measure js is a sum of 
continuous and singular components relative to \. In the general 
case, if js and v are measures on (22, »’) such that v is o-finite, there 
is a unique decomposition of v = vy, + v,, where v, is an absolute 
continuous measure relative to js and vy, is a singular measure 
relative to yu. This fact is due to the well-known Lebesgue decom- 
position theorem discussed in [Dsh2]. O 


PROBLEMS 


5.1 Prove Proposition 5.1. Let (Q, 3’, 4) be a measure space and 
let f € Cz'(Q, X’). Then, the set function v(A) = f fd isa 
A 


measure on 3). 


5.2 Let «4 and v be measures on (2, X’) such that v < w and 
suppose js is o-finite. Show that there exists a Radon- 
Nikodym density f € a , such that O< f <1. 


5.3 Let js and v be measures on (2, »’) such that v < yu and let v 


be finite and g € ae Denote A = {w € 2: g(w) 4 0}. Show 


that the restriction of on X’ 1 A is o-finite. Give an example 
where (A) is not finite. 


5.4 Let 7, be a Poisson measure on (R, B). Investigate whether 7, 
is absolutely continuous or singular relative to X. 
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5.6 


3.7 


5.8 
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5.10. 


5.11 


5.12 
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Let 1, p2, and ys be measures on (Q, »’) such that p; L js and 
p2 L ys. Show that p; + po L pu. 


Let p; < joand p2 L p. Show that p,; L po. 
Prove that p < yu and p | ys imply that p = 0. 


Let js and v be o-finite measures on (Q, »’). Show that ju and v 
possess densities f and g, respectively, relative to p = + v. 


Is orthogonality transitive? 


Let f € L(Q, XY, uw; R,,B(R,)), ps be a positive finite mea- 
sure, iy C XS’ bea sub- -o-algebra, and juo = Resx,/s. Suppose 
= {0,0, A,B = A‘} where A is a measurable set. Define 


I= aa motay | Patol + Ho (B jaiBy | fAH01.- 


Show that g is a Radon-Nikodym density of v = f fd with 
respect to jug. 


Let X be a nonnegative random variable on a probability space 
(Q, 3’, P) such that X = c P-a.s. (a constant). Is PX* < X? 
If it is, give a Radon - Nikodym density of P_X* w.r.t. A. 


Prove that yz | p if and only if uA p = 8. 
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density, 422 

measure generated by an integral, 422 

indefinite integral, 422 

Radon - Nikodym density, 422 

absolutely continuous measure, 424 

v < 4, means v is absolutely continuous with respect to pu, 424 

continuous measure, 424 

Nikodym, Otto, 425 

Saks, Stanislav, 425 

Stieltjes, Thomas, 425 

Riemann - Stieltjes integral, 425 

Riesz, Frigyes, 425 

Radon, Johann, 425 

Fréchet Maurice, 426 

Kolmogorov, Andrey, 426 

MO, X’), the set of all measures on (2, d’), 427 

MS = {v € M:v < p}, the set of all measures continuous relative 
to pt, 427 

J,» mapping defined on C-'(0, YR, B(R,), 427 

[CoQ 2), = LQ, 2, ws Ry), DGS 


pe? 


J,,], bijective map between 


Radon - Nikodym derivatives and continuous measures, 428 
Radon - Nikodym theorem, 429 
ia = ie reducer of ie 429 
Radon - Nos density, 429 


et =f, ‘v) = J, {v}), Radon - Nikodym derivative, 429 


probability density, 430 
probability density function, 430 
normal random oe 430 


f(a,0%;2) = hs exp{ - 
function, 430 

singularity of a measures, 430 

orthogonal measures, 430 


ee 


\, normal probability density 
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6. SIGNED MEASURES 


The situation below is motivational for the study of a more general 
class of set functions than those we called “measures.” Let (Q, 2’, i) 


be a measure space and let f € L'(0,2,;IR). Define the 
following set function on >’. 


v(A) = J fdu = [1(A) — f(A), 
where 


[(A) eae and 12(A) = fF ae 


It is easy to show that the set function v has all the properties of a 
measure except for being positive. [See Problem 6.1 for a more 
general case of f € L(Q, X’, uw; R, B(R)).] However, in the above 
decomposition v = ju; — [2, the set function v is represented by the 
difference of two measures. We study this type of set function, which 
we call a signed measure. We give a formal definition below, 
without saying anything about a decomposition that follows. 


6.1 Definition. Let (Q, »’) be a measurable space. A set function 


vy: 3) — Ris called a signed measure if: 


a) v(@) = 0, 
b) for each A € 3’, the value of v(A) is well 
defined, i.e., it is either finite or --oo or —ox, 


c) v is o-additive. 


To tell signed measures from nonnegative measures, we refer to 
the latter as positive measures. G(Q, 2’) denotes the set of all signed 
measures on the measurable space (Q, »’). O 


6.2 Remark. If a sequence of pairwise disjoint measurable sets 
{A,} is such that its union is finite under v [ie., |v(U°,A,)| < 
oo], then o-additivity of v fie. |v(°°,An)| = [9 v(An)]] 
implies that the series U°° ,1(A,,) must be absolutely convergent. 
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This, as we know, does not need to hold for series in the general 
case. The reader is encouraged to explain it. (See Problem 6.3.) O 


6.3 Definitions. 


(i) The signed measure is called finite if its range is a subset of 
R. Otherwise, is called infinite. The triple (0, X’,v) is called the 
signed measure space. According to the type of the signed measure, 
the signed measure space is referred to as finite or infinite. The 
signed measure v is called o-finite if %’ admits a countable 
measurable partition {Q,,,} of v-finite sets. 


(22) Sometimes, we need a notion of a finite set under v (or av- 
finite set or just a finite set). This refers to a measurable set A with 
|v(A)| < co. 

(iit) A measurable set P is called v-positive (or just positive) if 
v(P A) € [0, oo} for all A € ©’. A measurable set N is called v- 
negative (or just negative) if vUN 1 A) € | —o0, 0] for all AE XY. 
Obviously, P (JV) is positive (negative) if and only if for any meas- 
urable subset FE of P (N), v(E) > 0( < 0). 


We denote by 58 C » the family of all positive sets under v and 
by Nt C » the family of all negative sets under v. 


(iv) A set function v: 3’ — R is called continuous from below 
if for every monotone nondecreasing sequence {A,,} 7 C » it holds 
that 


Jim,,v(An) =» (U An): 


If the above property holds for a particular monotone nondecreasing 
sequence {A,,}, then v is called continuous from below on {A,}. 


(v) Let {A,} be a monotone nonincreasing sequence of sets 
from ) at least one of which is v-finite. A set function v: ©’ > R is 
said to be continuous from above on {A,,} if 


jim, ¥(An) = (A () An). (6.3) 
n=1 
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The set function v is continuous from above on 2’, if (6.3) holds for 
every monotone nonincreasing sequence {A,}| C »’ that has at 
least one v-finite set. In particular, if {A,,} | @, (6.3) reduces to 


jim (An) = 0 
and this is referred to as continuity from above at the empty set or, 
shortly, @-continuity of v. 


(vi) Any signed measure on the Borel o-algebra is called a 
signed Borel measure. In particular, a signed Borel measure on 
(R",B), finite on d.-bounded Borel sets, is said to be a signed Borel 
- Lebesgue - Stieltjes measure. O 


6.4 Example. Let 2 = [- 1,1], © = B(R)N[- 1,1], and let 
us define vy on SX as v(A) = XA(AN(0,1]) —A(AN[-1,0]). 
Obviously, v(@) = 0 and 


Vd onntAn) = Dinar {A(An O [0, 1) — ACAn [= 1,0) }. 
v(An) 
Thus, v is o-additive. Furthermore, v is finite. im) 
6.5 Remarks. 


(i) Notice that if A is a finite set under v, then any finite or 
countable decomposition of A consists of finite subsets under v, that 
is not as obvious as for positive measures. (See also Problem 6.6.) 


(it) Unlike any positive measure, a signed measure fails to be 
monotone. (See Problem 6.5.) Yet some weak form of monotonicity 
applies to signed measures. (See Problems 6.2 and 6.6.) 


(iit) We observe two obvious and noteworthy properties of 
positive and negative sets. Let P € $B and N € MN (be positive and 
negative sets). Then, 


(a) P(P)NY CH and P(N) NX CM (where P stands for 
the power set). Indeed, if A C P, then (A) > 0. If B C A, then B 
is also a subset of P and thus is positive. 
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(b) If Pe Band N EN, then PN N is a null set. Indeed, as 
a subset of both P and N, PMN must be simultaneously positive 
and negative. O 


We continue with a few more introductory properties of signed 
measures. 


6.6 Proposition. Let (Q, 27,1) be a signed measure space. If A 
and B are measurable sets such that B is v-finite and A C B, then 
A is also v-finite and v(B\ A) = v(B) — V(A). 


6.7 Proposition. Any signed measure v is continuous from 
below. 


6.8 Proposition. Let (Q, X’,v) be a signed measure space and 
let {A;,} be a monotone nonincreasing sequence of measurable sets 
such that for some n, v(A,,) is finite. Then signed measure v is con- 
tinuous from above on {A,}. 


6.9 Proposition. Let v be a finitely additive set function on the 
measurable space (Q, 5’) such that v(O) = 0. If v is continuous 
from below, then it is o-additive. If v is finite, then the continuity 
from above implies that v is o-additive. If v is finite and continuous 
at the empty set, then v is o-additive. 


(The proofs to Propositions 6.6-6.9 are left for exercises; see 
Problems 6.6 - 6.9.) 


6.10 Lemma. Let (0, 5’,v) be a signed measure space and let 
Aé wD be such that —co < v(A) < 0. Then, there is a negative 
subset N of A such that —co < v(N) < V(A). 


Proof. If A is negative, i.e. it does not contain at least one subset 
E with (Ef) >0, then the statement of the lemma is proved. 
Otherwise, assume that A has at least one such subset and let 


s; = sup{v(C): C C A} € (0, oo. 


Now, if s; = oo, then there is a subset A; C A such that v(A) > 1 
because by Proposition 6.6, all subsets of A are of finite measure 
under v. If, however, s, is finite, then there is a subset A; C A such 
that v(A;) > 5S]. Combining both cases, we conclude that there is 
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an A; C A such that (A) > min{ $s, 1}. Now, consider the set 
A\ A, which has the following property by Proposition 6.6. 


v(A\A1) = v(A) — (Ai) < (A), 


because v(A) > 0. If A\ A; is negative and thus it does not contain 
any subset E with v(£) > 0, then we are done with the proof, with 
N = A\A\. Otherwise, we apply to A\ A, the same procedure as to 
A arriving at a subset Ay of A\A; with v(A2) > min{}59, 1} 
(where sy = sup{v(C) : C C A\A;}) and seeing set A\(A; + Ae) 
with property 


v(A\(A; + A2)) = v(A\A1) — v(A2) < v(A\A1) < v(A). 


If the process we are going through sees its nth step, we arrive at a 
subset A, C A\S~"' Aj, with v(A,) > min{ $s, 1}, where 


S, = sup{v(C): CC A\r™ Ai}, 


and if AYA is a negative set, we stop the algorithm. However, 


if we never happen to stop the process, we finally have the set 
N = A\Y An 
with the properties asserted in the lemma. Indeed: 
1) —co < »(N) = (A) — VOT An) < v(A) < 0 


2) Because v()->°, An) = D572, v(An) > 0 and finite, v(A,) 


n= 


— 0, and v(A,,) > min{s,5, 1} making s,, — 0. 


3) On the other hand, by the definition of s,, for each subset 

fm A\ yo Ai, V(En) < 8, — 0. Hence, each subset F C N 
must have v(F) < lim,_..05, = 0 making N negative. i 
The following theorem states that there is an (essentially unique) 
decomposition of set 22 into a positive and a negative set relative to a 
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given signed measure v. This decomposition, referred to as a Hahn 
decomposition, leads us to the forthcoming Jordan decomposition of 
vy into the difference of two positive measures mentioned in the 
beginning of this section. We start with the following. 


6.11 Definition. Let (Q, 4’) be a measure space. Given a signed 
measure v on 3’, suppose (2 can be partitioned into two sets, Q* and 
Q~ such that QT = P is a v-positive set and Q~ = N is a v-nega- 
tive set and Q = Qt + (-. Any such decomposition of 2) is called a 
Hahn decomposition of Q (with respect to v or rather induced by v), 
in notation (Q*,Q7~) or most commonly (P, NV). O 


The existence of such a decomposition of 2) is established in the 
following theorem. 


6.12 Theorem (The Hahn Decomposition Theorem). Let 
(Q,27,v) be a signed measure space. Then there exists a Hahn 
decomposition (P, N) of Q with respect to v. 


Proof. We assume, without loss of generality, that 1 does not 
take the value —oo. Otherwise, we consider — 1 instead. If @ is the 
only negative set of v, then for each Ac 3X’, then 1(A) > 0. 
Furthermore, if there were a set A such that »(A) < 0, then by 
Lemma 6.10, there is a nonempty negative subset of A. Therefore, 
(Q, @) is the trivial Hahn decomposition and we are done with the 
proof. 


Otherwise, suppose there is at least one negative set of v and 
then let 


I =inf{v(£): E € » and EF is v-negative}. 


Clearly, J < 0. Then, there is a sequence {N,,} of negative sets with 
lim, —.V(N;,) = I. Because of Problem 6.10, 


N:=U Na 


n=1 
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is also a negative set. Regarding B,, as LJ Nz, we have {B,,} as a 
k=1 

monotone nondecreasing sequence of negative sets | N and hence, 
by Proposition 6.7, lim, ..v(B,) = v(V). Furthermore, because 
Br\Nn © By and B,, is negative, v(B,\N,) < 0 and v(B,\Nn) = 
v(B,,) — v(N,,) thus implying that »(B,,) < vCN,,). The latter yields 
that v(N) < J. On the other hand, as for a negative set, v(V) > J, 
and thus combining the two inequalities we have vp(NV) = I. 


Now we show that P = N° is a v-positive set. If this were not 
the case, then there is at least one measurable subset A of P with 
—oo < (A) < 0 and then, by Lemma 6.10, there is a measurable, 
negative subset B of A with »(B) < v(A); in particular, v(B) < 0. 
Then, B+ N makes a negative set such that (B+ N)= 
v(B) + v(N) < v(N) = I, which contradicts the fact that J is the v- 
limit inferior of all negative sets. 


If v € [—co, co), as we mentioned in the beginning of the proof, 
we instead consider — v € (—0o, co] with the corresponding Hahn 
decomposition (P,N). Then, the Hahn decomposition of Q with 
respect to v is obviously (NV, P). O 


6.13 Theorem (Uniqueness of the Hahn decomposition). A 
Hahn decomposition is unique in the following sense. If there is 
another Hahn decomposition (P', N') then PAP’ and NAN' are 
v-null sets. O 


The uniqueness of the Hahn decomposition is left for an exercise 
(Problem 6.13). 


6.14 Notation. Because by Theorem 6.13, Hahn decompositions 
form a single equivalence class modulo v, we denote this class by 
symbol [H*, H~],, and call it the Hahn class (modulo v). O 


Although any Hahn decomposition is a decomposition of 2 
(induced by a signed measure 1), the so-called “Jordan decomposi- 
tion” in the forthcoming discussion is the one of the signed measure 
itself. It states that each signed measure can be uniquely represented 
by the difference of two positive measures, which is kind of a 
converse of Problem 6.4. We start with the following. 
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6.15 Definition. Let (P, NV) € [H*, H],. Denote 
vt(-)=v(PN-) 
y-(-)= -—v(NN-) 
lv) =vt+v7 


and call all three of them the Jordan variations, of which v* is the 


positive, v~ is negative, and |v| is total the variation relative to 
(P,N). Oo 


6.16 Remarks. 


(i) Obviously v* and v~ are positive measures and as the 
result, so is |v]. 


(it) v =v* —v-, which can be readily shown. 


(iit) As we show in Theorem 6.17, Jordan variations are 
invariant of any Hahn decomposition from [H*, H™],. O 


6.17 Theorem. Any two sets of Jordan variations relative to two 
Hahn decompositions from the same Hahn class |H*,H~|,, are 
identical. 


Proof. Let (P, N),(P’, N’) € |[H*, H”],. To prove the theorem 
we need to show that for each FE’ € »’, 


VPN E)=v(P'NE) and v(NNE)=v(N'NE). 
From P = (PN P’) + (P\P’) we have 
PRE=(POP VE) HPP one. 


Because by the uniqueness of the Hahn decomposition PAP’ is a 
null set, so is P\ P’ and so is (P\P’) N E. Thus, 


WPOP NE) Huns). 
Interchanging the roles of P and P’ we have 
WP OP V2) HP" 12): 
The same arguments yield that v(N QE) =v(N’OE). O 
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6.18 Remarks and Examples. 


(i) In light of Theorem 6.17 we need to modify Definition 6.16 
about Jordan variations, in as much as they no longer depend upon 
any particular Hahn decomposition. In the sequel, we omit the phrase 
“relative to (P, N)” from Definition 6.16. 


(it) The pair (v*,v~) of positive and negative Jordan varia- 
tions, as noticed restore v = vt —v~. The latter representation is 
called the Jordan decomposition of a signed measure v. According 
to Theorem 6.17, this decomposition is unique. 


(iit) Again, by Theorem 6.12, there is a unique Hahn class 
[H*,H~], induced by a signed measure v. This class, in turn, 
generates a unique Jordan decomposition. We wonder if this process 
can be reversed in some sense. For instance, according to Problem 
6.4, if ju; and j12 are any two positive measures on 2’, at least one of 
which is finite, then the difference v = ju; — [2 is a signed measure. 
Under what condition is j1; — p12 the Jordan decomposition of v? 
(Here we are talking about a practical utility of existence Theorem 
6.12.) 


We point out one property of the Jordan decomposition 
vy =v*t—v_, namely, that the positive and negative variations in 
the Jordan decomposition must be orthogonal in the same sense as 
that introduced in Section 5. So, we recall that two positive measures 
jt; and [2 are orthogonal if there is a measurable set A such that 
[1(A) = fa(A°) = 0. In the event of the Jordan positive and 
negative variations any Hahn pair (P, N) can represent (A°, A), 
namely, v*(N) = v~(P = N°) =0 and so they are “orthogonal 
with respect to (P,N).” Consequently, if we want to associate 
measures 1; and j12 above with a particular Hahn class [H*, H™|_,, 
we need to make sure that they are orthogonal (j1; L ju2) with respect 
to any (P, N) € [H*, H7],. 

For example, let €¢, be a Dirac point mass and \ the Borel - 
Lebesgue measure on B(R). Then, as we know, ¢, L A. For 
instance, A({a}) = €,({a}°) =0. Then, if we define the signed 
measure v asv = \—e, =v* —v (noticing that e, is finite), we 
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conclude that it is the Jordan decomposition. In this case, an 
associated Hahn pair is (P = {a}°, N = {a}). 


(iv) In another example, let 7, be a Poisson measure with 
parameter a defined as 7, = ie ee The Poisson measure 
Tq 1s also orthogonal to \. We can take P = Nj and see that A(P°) 
= T,(P) = 0. Thus, v = A — mq is the Jordan decomposition of v 
relative to (Nj,No). Now, there are other Hahn decompositions, 
besides (P, P°) from the class [H*, H~],,. For example, let positive 
and negative sets be P’ = {-1,0,1,2,...}© and N’= 
{—1,0,1,2,...}, respectively. Or P=Z¢ and N =Z. 


Yet another noteworthy Hahn pair is N=Cu Z, where C’ is 
the Cantor set (defined in Example 3.11, Chapter 5), and P=N _ 
Obviously, \(N ) = Ty (P) =), Here N , unlike other negative 
sets in the same context, is uncountable. im 

In a nutshell, the above discussion can be formalized as follows. 


6.19 Proposition. Let py, and jz be two positive measures on a 
measurable space (Q, 5’) such that pu, L p2 with respect to a de- 
composition Q = Q* +Q7 [more specifically, p4(Q7) = p2(Q*) 
= 0]. Then, [u, — 2 = v is a Jordan decomposition of v relative to 
(Q*,Q7) which is an element of Hahn class |H* , H™|,,. 


Proof. Let E* CO* MS. Because p2(QT) = 0, by monotoni- 
city of positive measures, fi2(E*) = 0. Therefore, v(E*) = 41 (E*) 
> 0 and thus * is a positive set relative to v. Similarly, Q~ is a 
negative set relative to v. O 


One of the very useful benefits of Proposition 6.19 arises in the 
integral representation of a signed measure that meets everyone's 
expectation that vt = [ftdu, v- = [f~ dw are positive and 
negative Jordan variations of the integral measure v = [ fd. We 
give more details on this in the following. 


6.20 Corollary. Let (Q, 2’, 1) be a positive measure space and 
let v be the indefinite integral v = {fd generated by 
f €L(Q, 2, u;R) and yw. Then, v is a signed measure and its 
Jordan variations are: 
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vt = fftdpv- =JSf- du and |v|= f\fldp, 


where P={7>0) M=1f <0) ond 7° =f, = fle 
Furthermore, a_ relevant Hahn pair (P,N) = (Qt,Q7)¢€ 
Bf oP =47 20) and N =17 <0}. 


The proof of this proposition is quite simple and is left for the 
reader as Problem 6.18. O 


Another significant property of the Hahn - Jordan decomposition 
is that it allows a more explicit representation of the Jordan varia- 
tions playing a pivotal role in the forthcoming applications. 


6.21 Proposition. Let (Q,7,v) be a signed measure space. 
Then the positive, negative, and total variations of v can be 
characterized as follows. Given any measurable set A € 3, 


(i) vt(A)=sup{V(E): Ee Yn A}. 


(ii) v-(A)=sup{ —(F£):E € SNA} = — inf{v(£): 
Ee SNA}. 
(iii) |v|(A) = sup{)¢y_, |v (Ex)|: {FA .-., Bn} C XY and 
1k CA}: 


Proof. Denote by (P,N) a Hahn decomposition of (2 with 
respect to v and let 


Veyp(A) = sup{v(E): E € 3 A} 
and 


Vint(A) = sup{—v(E): EF € 3M A} 
= ~inf{v(E): Ee SN A}. 


(i) Clearly, v*(A) = V(ANP) < Veup(A). To prove the in- 
verse inequality we notice that because (P, X’'M P,Ressppv) is a 
positive measure space, Ressqpy is monotone and hence, for each 
BEeESNA, 
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VE) = VW(ENP)+uebnNn) 
<u(EN P)<u(An P)=v*(A). 
This yields the required inverse inequality 
Vsup(A) < (A) 


and thereby proves part (2) of the proposition. 
(77) By interchanging the roles of P and N for — v, we have 


(-vy"(A) = -V(ANN) =v (A) 


and therefore v~ = —Vinr. 
(iit) Let (P,N) € [H*, H7], and let A € S’. Then, 
JV\(A) = VAN P)-vU(NNA) = |v(Ey)| + |v(2»)I, 
where FE, E> is a partition of A. Obviously, 


|v|(A) < sup{>>7_.|u(Ex)| : {Fa,-.., Bn} CX and 
tC Ay aay. (6.21) 


Let F,,..., E, be a disjoint measurable “subpartition” of A (i.e., a 
partition of a measurable subset of A). Then, 


VinilY (Ei) | = Veale (Ei) -— v(B,)| 
< Vile (BE) + v(B,)| = Vile |(Bi) 
= |" (oi Fi) < |v|(A). 
The latter inequality also holds for sup{}~‘"_, |v(E;)|} and thus, 


aa < |v|(A). (6.21a) 
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Inequalities (6.21) and (6.21a) imply that a4 = |v |(A). O 
6.22 Remarks. 


(i) To summarize the Hahn-Jordan decomposition, by Pro- 
position 6.21, we have that 


yt (A) = sup{v(E£): EF € 2 N A} =Vu(AN P), 


—vy-(A) = inf{v(E): Ee VN A} =v(ANN) 
and 


(A) = vt (A) — (A). 


These equations offer an obvious interpretation. Given a measurable 
set A, the signed measure 1 attains its maximum and minimum 
values on two measurable disjoint subsets of A: AN P and ANN, 
respectively; and the entire measure of A is the sum of these two 
values. In particular, it follows that P and N are the v-maximal and 
v-minimal subsets of 2 (in notation, P = S and N = J) on which v 
attains maximum and minimum values, respectively. This is due to 
the fact that (P, 2’ P, Ressqpv) is a positive measure space and 
hence Resyqpy is monotone. A similar argument explains why v 
attains a minimum value on NV. 


(it) Let (Q, 2’, vv) be a signed measure space. If v is a positive 
measure, then obviously, S = 9) and J = @, where S and J are intro- 
duced in (7). Consider the case with v = [ fdp, where p is a positive 
measure on (Q, ©’) and f € L1(Q, 2’, p). Then, 


ae J fdp+ Jf fap. 


AN{f20} AN{ <0} 
Therefore, 


f fap<v(A)< ff fap, VAEX, (6.22) 
{f<0} {f>0} 


and consequently, S = {f > 0} andJ = {f < 0}. 
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Inequality (6.22) is trivial in the sense that v(A) = v*(A) — 
v-(A) < v*(A) and v(A) > —v (A). All we need is to recognize 
the left- and the right-hand sides of (6.22) as the negative and 
positive Jordan variations of v. Corollary 6.20 helps interpret this 
context of (6.22). Indeed, we have that 


V(A) <v+(A) = (SNA) = fft1adp 
and that v-(A) = —v{IN A} = —f f-ladp > —v(A). 


(277) If js and p are two positive measures at least one of which is 
finite, the difference v = js — p is a signed measure. However, it is 
not the Jordan decomposition of v unless j: and p are orthogonal (as 
we learned from Proposition 6.19). Let (Q,2’,v) be a signed 
measure space. Denote by vg = ReSsqgv, where FE is a measurable 
set. To obtain the Jordan decomposition of v = 44 — p, we need any 
Hahn decomposition of 92 with respect to v. Say, (P, N) is one. 
Then, Jordan variations are 

vt = vp = Up — pp (6.22a) 
and 
Vy- = —Vy = pn — LIN. (6.22b) 


We can also make use of formulas of (2) and (22) of Proposition 6.21 
to determine the positive and negative variations. O 


6.23 Examples. 


(2) Let €9 be the point mass and P a probability measure on 
(R,B). We find a Hahn decomposition of the signed measure 
v = P — € . We show that J = {0} is a v-minimal (and negative) set 
in light of Remark 6.22(7). For an A € B, 


v(ANI*) =P(ANI*) >0, 


and either 
v(ANTI) = P({0}) — e9({0}) = P({0}) — 1, with OE A 


or 
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v(ANI) =0, with 0 ¢ A, 


which implies that v(ANJ) <0. Using relations (6.22a) and 
(6.22b), we have the Jordan decomposition of v: 


yt(A)=v(ANI®) =P(ANL*), 


and 


v~(A) = 14(0)0 — P({0}). 


Note that {0} = J is the set where v attains its minimum. 


(27) Let v = A — pt, where A is the Lebesgue measure on (R, B) 
and 4 is the geometric measure defined as 


= ee 0 A Lal oe 
Clearly, N = {1,2,...} is a negative set relative to v, whereas 


P = N°“ isa positive set. Thus, (P, NV) is a Hahn decomposition of 
R relative to v and, consequently, for every Borel set A, 


y+(A) =(A— w(AN {1,2,-..}) 
and 


y-(A) = (w— MAN {(1,2,...) 


represent the Jordan decomposition of v. Because N is a A-null set, 
the latter reduces to 


py (A) = pA nq 2ycx hk 


Therefore, v attains its minimum at N and its value is —1, whereas 
the maximum value of v is co and it is attained at N°. O 


6.24 Remark (Signed Measures as a Lattice). 


As we worked on positive measures in Chapter 5, Section 1, 
imposing in 9(Q, »’) some space structure, we similarly approach 
G(Q, 3’) as the set of all signed measures on a measurable space 
(Q, 7). It cannot be a linear space to begin with, in as much as the 
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sum of two signed measures need not be a signed measure, for 
obvious reasons that oo — oo (we can encounter) is not defined. 


However, we can structure a smaller set G,(Q, 2’) of finite 
signed measures by making it a vector space over field R and 
including @ as the zero measure. It can be easily shown (Problem 
6.23) that, unlike its predecessor IN(Q, 27), (6,(Q, X’), R, +, -) is 
indeed a vector space. Furthermore, in Chapter 1, Section 1, we 
showed that (9(Q, 5’), < ) was a lattice. We prove that similar 
lattice operations establish (G,(Q, 4’), = ) also as a lattice and not 
only that. O 


In the next two lemmas whose proofs we leave to the reader, we 
demonstrate that (G,(Q, 2’), < ) is indeed a lattice and further 
enhance it to a vector lattice associating the Jordan variations with 
the familiar lattice components including v*, v~, and modulus |v. 


6.25 Lemma. Let 1, 72 € G,(Q, 2’). Then the set function 
p(A) = sup{, (F) +12(A\E): Be 2m A} 
is a finite signed measure on /. 
(See Problem 6.19.) O 
6.26 Lemma. Let 11, 72 € G,(Q, 2’). Then the set function 
o(A) = inf{y,(E£) +1(A\E): Ee 2 A} 
is a finite signed measure on >). 
(See Problem 6.20.) 


O 
6.27 Remark. We introduce on G,({, X’) the partial order =< 
such that v x v if and only if (A) < v(A) forall A € &. O 
The theorem below states that (G,(Q, 3’), x ) is a lattice. 
6.28 Theorem. Let p and o be defined as in Lemmas 6.25 - 6.26. 


Then, p = 1 V V2 and 0 = 4, \ 2, and consequently, the partially 
ordered set (6,(Q, %’), ~X ) is a lattice. 


Proof. (i) Let U({11,2}) C G.(Q, X’) denote the set of all 
upper bounds of the two-element set {1,17}. We show that 
p €U({M4,12}). In other words, we show that for each A € ¥, 
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p(A) > 14(A) and p(A) > 12(A), in notation, p > 1 and p = 1%. 
Indeed, for any measurable subset E' C A, 


p(A) = (EF) + (A\E). 


Choosing E = A or E=@, we arrive at p > {1,12} and thus 
p € U({4, 2}). Similarly, it can be shown that the measure o from 
Lemma 6.24 belongs to the set L({1,v2}) of all lower bounds of 
{1, V2}, in notation, o < {14, V2}. 


(i7) Let we U({™,v2}) be another upper bound of the set 
{1,12}. We show that p x u, which validates that p is the least 
upper bound of {1,12}. Let E C A be two measurable sets. Then, 


v1(B) + )(A\B) < u(B) + u(A\E) = u(A) 
implying that p X u and that p = 1, V 1. 
Now, let 1 € £L({™, v2}). Similarly, 
I(A) = 1(F) + UA\E) < 14 (£) + (A\E) (6.28) 
holding for all E C A. Therefore, the inequality sustains for the 


infimum on the right of (6.27). Consequently, 1 < o and thus o is 
the greatest lower bound of {1, v2}. O 


6.29 Remark and Notation (Signed Measures as a Riesz 
Space). As G,.(Q, 4’) is authenticated as a vector lattice (or syno- 
nymously, Riesz space), we use the notation (G,(Q, 27),R, +, -, 
x ) as for the vector lattice carried by the set G,(Q, ’) of all finite 
signed measures. 


Furthermore, from Lemmas 6.25 and 6.26 and Theorem 6.28 we 
concluded that p and o defined as 


p(A) = sup{v;(E) + V2(A\ EF) :BELN A} 
o(A) = inf{v,(E) +1(A\E): Ee SNA} 


were p = 1, V vy and 0 = A v2. Compared to Proposition 6.21 
(i-i1), we see that v V0 =vt andvA@=v, whereas |v| = vt 
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+ v~, as per Definition 6.15 (called there the Jordan variations), 
are consistent with the associated lattice operations. 


Along this side, we can get an access to the whole reserve of 
properties made available for the general lattice operations from 
Chapter 1, Section 7, which no doubt is a useful asset in studying 


space (G.(Q, 2); R,+,°, 3 / O 
6.30 Definition. We further embellish the space structure of 
(6,(Q, X’), RR, +, -, X) by introducing a real nonnegative func- 
tion on G,(Q, 2”) by 
IIvl| = |v|(2), 
which we claim to be a norm. O 


6.31 Proposition (The Norm Properties). 


(@) |- || 20. 

(it) ||v|| = 0 ifand only ify = 8. 
(722) |lav|| = |a|||v]. 

(tv) |lvr + vel] < |r|] + |lvell- 


Proof. In all instances below we make use of properties of lattice 
operations from Chapter 1, Section 7. 


(i) Is obvious. Furthermore, by Problem 6.17, || - || is finite. 

(it) O= |v|\(Q) = v*(Q) + v7 (Q) = 0 implies that v = 8. 

(itz) |lav|| = jav|(Q) = |al|v|(Q), due to Problem 7.17(v7). 

(iv) Due to the triangle inequality for modulus |v; + 1| x 
|v1| + |v2| [Problem 7.18(v)], we have ||v1 + v2|| = |v1 + v2|(Q) < 
|v7,|(Q) + r2(Q). im 

The introduction of a norm on G,.(Q, »’) is fully rewarded in the 
following theorem. 


6.32 Theorem. The normed vector space (G,(Q, 5’), || - ||) is 


Banach. 
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Proof. Let {v,,} C G,(Q, 5’) be a Cauchy sequence. Then, for 
each € > 0, there is an N(e)-tail of {v,,}. Thus we have 


E> lei —_ Vr|| — |Vin — Vn | (Q) 
= kee — Un|(A) = |Vin(A) — U,»(A)|, (6.32) 


for each A € 5’, implying that {v,,} is a Cauchy sequence uniformly 
on +’. Therefore, for each A € 3’, {v,,(A)} is a Cauchy sequence of 
real numbers and as such, it is convergent to a finite real number, say 
v(A). Therefore, v is a set function on »’, to which the sequence 
{v, } converges setwise on »’. 


Furthermore, by fixing n in (6.32) and passing to the limit in m 
we get 


|v(A) — %,(A)| <e,n > N and forall A € &. (6.32a) 


We show that (7) v is a measure and (77) ||v — v,|| — 0. 
(i) Obviously, v(@) = 0. Now, let Aj,...,A% be disjoint 
measurable sets. Then, 


(SLi) = Litin-sooYn (eh) = oE limysooYn(Ax) 
= Wie (Aa)- 


Therefore, v is finitely additive. 
Next, we show that v is continuous at the empty set. Let { A, } | 


[o-e) 
C »’ such that (] A; = @. Due to the uniform convergence of {v,, } 
k=1 


on ¥) to v, as per (6.32a), for each € > 0, there is an N(e)-tail of 
{v,} such that |v,(A) — v(A)| <e,n > N. Now, by continuity at 
@, for each v,,, we have limz..¥,(A,) = 0. In other words, for 
each € > 0, there is an M such that for all k > M, |v,(Ax)| < 5. 
then, forallk > M andn> WN, 


|v(Ag)| < |v(Ak) — Yn(Ag)| + |Un(Ag)| < 
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This implies that v is @-continuous. By Proposition 6.9, then v is o- 
additive, as it meets all assumptions of Proposition 6.9, namely, v is 
finite, finitely additive, and continuous at the empty set. 


(it) We use Proposition 6.21 (2, ii) according to which 


; (vq —v)" (Q) = sup{(y, —v)(E): Ee 5} 


(vy, —Vv) (Q) = sup{ — (y% —v)(F): Be x} 
considering that v,, — v € G,(Q, 2’). Furthermore, using (6.32a), 
(Yn —V)(A) < |(% —v)(A)| Se 
as well as 
—(Yp, — v)(A) < |(%n — v)(A)| <e 
both holding for n > N and uniformly for all A € 2’. They in turn 
imply that 
(Un — v)*(Q) + (% — Vv) (Q) = || — vl < 2¢, 


good for all n > N and thereby the claimed convergence v,, — v in 
the norm. 


In conclusion, (G,(Q, »’), || - ||) isa complete NVS. O 


6.33 Remark. The space (G,(Q, »’),R, +,-, <,]||- ||) turns 
out to be richly endowed with a variety of structures and operations 
making it an awesome paradigm of an NVS, which is a lattice and 
Banach. We call it for short a Banach lattice. O 


PROBLEMS 


6.1 Show that the integral set function v introduced in the begin- 
ning of this section for the case of an f € L(Q, »’, ;R, B(R)) 
is a signed measure (i.e., v(@) = 0) for each A € 3’, (A) is 
either finite or +00 or —oo, and v is o-additive on »’. 
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6.2 


6.3 


6.4 


6.5 


6.6 


6.7 


6.8 


6.9 
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Let v be a signed measure on a measurable space (2, 5’) and 
let A be a finite measurable set. Show that if A = A, + Ao is 
any decomposition of A, then either A; and Ag are finite. 
Furthermore, show that if v(A) = oo, then at least one of the 
subsets A; or Ay must be positively infinite. If A = U°° | A, is 
a countable measurable decomposition of a finite set A, then 
all sets A,,'s are finite. In particular, it follows that any subset 


of a finite set must be finite. 


Why must the series U°°.,v(A,) in Remark 6.2 be absolutely 
convergent? 


Let ju; and jg be two positive measures on a measurable space 
(Q, 7) at least one of which is finite. Show that v = ju, — jug 
is a signed measure. 


Show by a counterexample that a signed measure is not mono- 
tone. 


Prove Proposition 6.6. Jf A and B are measurable sets such 
that B is v-finite and A C B, then A is also v-finite and 
v(B\ A) = v(B) — (A). 


Prove Proposition 6.7. Any signed measure v is continuous 
from below. 


Prove Proposition 6.8. Let (Q,2’,v) be a signed measure 
space and {A,,} be a monotone nonincreasing sequence of 
measurable sets such that for some n€ N, v(A,) is finite. 
Then the signed measure v is continuous from above on { A,}. 


Prove Proposition 6.9. Let v be a finitely additive set function 
on the measurable space (Q, 5’) such that v(®) = 0. If v is 
continuous from below, then it is o-additive. If v is finite, then 
the continuity from above implies that v is o-additive. If v is 
finite and continuous at the empty set, then v is o-additive. 
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6.10 


6.11 


6.12 


6.13 


6.14 


6.15 


6.16 


6.17 


6.18 


Let $8 and MN be the families of positive and negative sets, 
respectively. Show that $8 and Nt contain the empty set and 
they are closed with respect to differences and at most 
countable unions and intersections. 


Let 58 and MN be the families of positive and negative sets, 
respectively. Show that v is monotone on either family. 


Let (Q, 27,1) be a signed measure space and let A € X’ be 
such that 0 < v(A) < oo. Show that there is a positive subset 
P of A, with v(A) < u(P) < co. (Hint: Use a similar ap- 
proach as in the proof of Lemma 6.10.) 


Show that all Hahn decompositions form a unique equivalence 
class. 


Let v be a signed measure on (2, »’) represented by a diffe- 
rence of two positive measures v = jl; — lg. Show that 
fy > vt and p2>vV-. 


Show that any signed Borel - Lebesgue - Stieltjes measure on 
(R", B) is o-finite. 


Let (0,7,v) be a signed measure space. Show that 
|v|(A) = 0, for an A € S’, if and only if v(S) = 0 for each 
S € 3'N A. Show by example that 1(A) = 0 is not sufficient 
for |v|(A) = 0. 


Let v € G,(Q, 2’); that is, v is a finite signed measure. Is it 
true that v is bounded? 


Prove Corollary 6.20. Let (Q, 5,4) be a positive measure 
space and let v be the indefinite integral v = [ f djs generated 
by f € L(Q, XY, u;R) and yu. Then, v is a signed measure and 
its Jordan variations are: 


vt = fftdy,v- = ff dp and |v| = J\fldp, 
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6.19 


6.20 


6.21 


6.22 


6.23 
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where P={f >0}, M={f <0}, and f* = fip,f- = 
flys. Furthermore, a relevant Hahn pair (P, N) = (QT, Q7) 
elf" .H- |p Pat) SU) ond N= {Ff = 0, 


Prove Lemma 6.25. Let 1,12 € G,(Q, 5’). Then the set 
function 
p(A) = sup{m(F) + 12(A\E): Be 3m A} 

is a finite signed measure on ). 

Prove Lemma 6.26. Let 11,1) € G,(Q, 5’). Then the set 
function 

o(A) = inf{m(F) +12(A\E): Be 2 A} 

is a finite signed measure on 3). 


Let v, p € G,(Q, 2’). Show that v + p € G,(O, »’). Is it true 
that 


v+p=(v+p)°—(vtp) 


=vt+pt—-yvy—p? 


Let v € G(Q, 2’) and let a € R. Show that av € G(Q, X). Is 
it true that av = (av)* — (av) = avt — av~? Will there be 
any changes in the Hahn class? 


Show that G,(Q, 2’) is a vector space over the field R. 
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NEW TERMS: 


signed measure, 434 

positive measure, 434 

6(Q, »’), set of all signed measures on (Q, 2’), 434 

finite and infinite signed measures, 435 

o-finite signed measure, 435 

finite (v-finite) set, 435 

positive (v-positive) set, 435 

negative (v-negative) set, 435 

3B, N, families of positive and negative sets, 435 

continuous from below set functions, 435 

continuous from above set functions, 435 

continuity from above at the empty set, 435 

()-continuity of a signed measure, 436 

signed Borel measure, 436 

signed Borel - Lebesgue - Stieltjes measure, 436 

Hahn decomposition, 439 

Hahn decomposition theorem, 439 

[H*, H~],, Hahn class modulo v, 440 

vt(-),v-(-), |v] =v* +v~, Jordan variations, 441 

positive and negative variations of a signed measure, 441 

total variation of a signed measure, 441 

Jordan decomposition of a signed measure, 442 

signed measure generated by the integral, 443 

characterization of Jordan variations, 444 

v-maximal and v-minimal sets, 446 

geometric measure, 448 

signed measures as a lattice, 448 

6,(Q, 3), set of all finite signed measures, 449 

(G,(Q, 2’), < ), vector lattice of finite signed measures, 449 

signed measures as a Riesz space, 450 

\|v|| = |v|(Q), norm of signed measure v, 451 

(G..(Q, 2), || - ||), normed vector space of finite signed measures, 
451 

((6,(Q, ©’), R, +, -, ~,]]- ||), Banach lattice of finite signed 
measures, 453 
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7. COMPLEX MEASURES AND INTEGRATION 


Our next embellishment of the space G,(Q, 2’) of finite signed 
measures is the space of all complex measure which plays a signifi- 
cant role in integration. 


7.1 Definitions. 


(i) Let (Q, X’) be a measurable space. A set function € on » is 
said to be a complex measure if: 


(a) € is valued in C. (Notice that being valued in C, € must 
not have infinite values and, therefore, of those signed or 
positive measures only finite ones can be qualified as 
complex measures. ) 


(0) &(O) = 0. 


(c) € is o-additive. Analogous to the signed measures [see 
Remark 6.2(27)], equation 
| ECan An) | = | yy C(Ag) | 
(where |-| stands for the two-dimensional Euclidean 


norm), implies that the series U°° ,€(A,,) is absolutely 
convergent. 


The triple (Q, 2’, €) is referred to as a complex measure space. 


(it) Given a complex measure space (9, 57,&), the complex 
measure € is (explicitly) defined as € = 1, + 2v2, where 1, and 12 are 
finite signed measures on >’. Hahn decompositions should then be 
applied for v, and 1 and their corresponding Jordan decompositions 
will yield 


E=vi—v, +i(vy —vZ), (6.23) 


with vy, v;, vz, and vz being positive finite measures. We will 
call € in (6.23) the Jordan decomposition of the complex measure & 
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and the positive measures v;', 1; , v;, and v~ the Jordan varia- 
tions. O 


7.2 Remark. It should be noted that the nature of a complex 
measure is different from that of a signed measure. The key reason is 
that the real and imaginary parts of a complex measure are represent- 
ed by two arbitrary signed measures that carry out entirely different 
Hahn classes and Jordan variations. Bearing this in mind, we cannot 
expect the same characterizations between Jordan variations induced 
by Hahn decompositions and those in Proposition 6.21. For instance, 
if we define the total variation of € as |] =v +, +v7 +7, we 
cannot expect it to agree with the characterization 


Jel(A) = sup{2_,é(Be)| + {E,--, Ba} © Hand 
Riles A} (7.2) 


holding for signed measures. Both are likely to be disconnected and 
consequently, we utilize the latter (rather than the former) to re- 
present the total variation. So, we use (7.2) as the principal definition 
of the total variation of a complex measure €. O 


Having a more consistent definition of the total variation for € we 
need to validate that the total variation is a measure, which we would 
not have to, should the representation of the total variation be in the 
form |€| = vf +p tus +5. 


We begin with three introductory statements very similar to 
Propositions 6.7 - 6 9 for signed measures. We thus leave their proofs 
to the reader (Problems 7.3 - 7.5). 


7.3 Proposition. Any complex measure €& is continuous from 
below. 


7.4 Proposition. Let (0, ’,€) be a complex measure space and 
{A,,} be a monotone nonincreasing sequence of measurable sets. 
Then the signed measure € is continuous from above on { A, }. 


7.5 Proposition. Let € be a finitely additive set function on the 
measurable space (Q, 5’) such that &(@) = 0. € is continuous from 
below if and only if € is o-additive. 
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7.6 Proposition. The total variation of a complex measure & on 
(Q, 2’) is a finite positive measure on (Q, >’). 


Proof. First we notice that foran EF € », 
e(B)| = (BP + (2(B)P < |ra(B)| + 2B) 


< | |(Z) + |v2|(F). (7.6) 


Here and throughout other numerous instances, the reader should be 
cautious not to confuse the Euclidean norm |-| with the total 
variation |-|, because we use the same notation (running out of 
symbols). Now, let A € »’ and F),..., E,, be a measurable subparti- 
tion of A, meaning that £),..., £, (given some n € N) are disjoint 
and >", E; C A. Then from (7.6), 


ini l6(Z)| < doin |(E%) + ie lYel (FE) 


= || Ei) + [Pel oj Fi) < |i |(A) + |2(A). 


Obviously, the same inequality will hold for sup{}77_,|€(Ex)|} of 
all subpartitions of A and thus 


E(A) S |i |(A) + [v2 (A). (7.6a) 


In conclusion, the latter inequality is the assurance that the set 


function [57, R,, ||] defined so in (7.2) is reduced to [X’, R,, |&|], in 
as much as |v;| and |v| are finite positive measures. 


Obviously, |€|(@) = 0. Next we show that |£| is a content on »’ 
(i.e., finitely additive). Let A and B be disjoint measurable sets and 
let E,,..., E, be a measurable subpartition of A + B. In as much as 


E(E;) = €(E;9 A) + €(E;,9 A), 
we have 

|E€(Ei)| < |€(Ei.9 A)| + [EE 9 B)| 
and thus 
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wiail€(Ea)| < Via le(Ei 9 A)| + Ve le(Li 9 B)| 
(because £,0A,...,E,NA and £,O8B,...,E, 1B are meas- 
urable subpartitions of A and B) 


< |€|(A4) + [€|(B). 
The latter inequality holds true also for sup{}>,_,|€(Ex)|} of all 
measurable subpartitions of A + B. Therefore, 


JE|(A + B) < |€|(A) + [€](B). 
Now, we show the inverse inequality. Firstly we notice that if 

E,,..., E, is a measurable subpartition of A + B, 

Fy: = By 1) Ayes, Fa =f 1A 
and 

Psa i EF, esrerere a a E, NB 
are measurable subpartitions of A and B, respectively, and thus 
F,,..., Fo, is a subpartition of A + B. Hence, 

ye le()I < [el(A +B) 


or in the form of 


lA) + VienlE(A)| < l6l(4 + B) 
implying that 


EI(A) + [E1(B) < |8(A + B) 
and consequently, that |¢| is a content on ’. 


Finally, due to (7.3a), |€| is continuous at the empty set and thus, 
its c-additivity is due to Proposition 1.7(2i), Chapter 5. O 


7.7 Remark. Although the set G(Q, >’) of all signed measures is 
not a vector space (the sum of two signed measures need not be a 
signed measure, as we can atrive at oo — oo), the space C(Q, 2’) of 
all complex measures (over field C) is. To ensure that @(Q, >’) is a 
vector space we proceed in the same way as for G,.(Q, »’), namely 
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that C(Q, X’) is closed relative to linear combinations of complex 
measures with complex numbers. Next, the zero measure @ needs to 
be included in @(Q, »’) as well. Furthermore, it can be shown that 
\|S|| defined as |€|(Q) is a norm on @(Q, X’) (see Problem 7.10) and it 
therefore structures @(Q, >’) as a normed vector space. Finally, 
Theorem 7.8 below (see Problem 7.13) claims that (@(Q, 3), || - ||) is 
even a Banach space. O 


7.8 Theorem. The normed vector space (C(Q, »’),||- ||) is 
Banach. O 


The following is an embellishment of the integral notion of real- 
and complex-valued functions with respect to signed and complex 
measures. 


7.9 Definitions. 


(2) The integral of a complex-valued function with respect to 
a positive measure. Let [Q,C, f = u-+iv] be a complex-valued 
function. Given a o-algebra »’ in 2, f is measurable, if for every 
Borel set B € B(R?), f*(B) € X, as usual. By using the projection 
operators one can show that f is 7-B(R?) measurable if and only if 
u and v are 5/-B(IR) measurable. (More about it in Section 9 on 
product measures.) Now, given a positive measure pp € IN(Q, d’), 
we say that fe L1(Q,2,u;C) or f is p-integrable if |f| € 
IO, 2) 63 Ry )s 


As |f| < Ju] + |v] < 2|f|, f is integrable if and only if both u 
and v are elements of L1(Q, 2’, ju; R) and in this case we will write 


Jfdw= fudu+ifodp. (7.9) 


Therefore, L1(Q, X’, 1; C) is a linear space with the integral being a 
linear functional on L£1(Q, 2’, 1; C). All major theorems of integra- 
tion (cf. Sections | and 2) hold true with very minor notational modi- 
fications. 


(72) The integral of an extended real-valued function with 
respect to a signed measure. Let v € G(Q, >’) with its Jordan 
decomposition v = v* — v~. Denote 
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DO, 5,v;R) = L1(2,5,vt;R)NL'(0,2,v-;R). — (7.9a) 


The integral of a function f € C~'(Q, ©;IR) relative to the signed 
measure v is then defined as 


{fdv = [fdvt — ffdv-. (7.9b) 


The function f is said to be integrable with respect to the signed 
measure v, if f € L'(Q, X’,v;R). The value of the integral [ fdv is 
then finite. (For further discussion, see Problem 7.15.) 


Notice that the full decomposition of (7.9b) is 


ffdv = [ftdvt + ff dv — [f-dvt — fftdv 
(7.9c) 


and thus the space L'(Q, X’,v;IR) could be enlarged by including 
those f's whose one of the two parts, “positive” [ftdv* + [f-dv 

or “negative” [ f~dv* + [f*dv~ in (7.9c) is finite. In this case, we 
say that the integral [fdv of f exists and we write f € 
(9, 37,7; R). 


(722) The integral of an extended real-valued function with 
respect to a complex measure. Let € =1, +71) be a complex 
measure on a measurable space (Q, »’). Denote 


L'(Q, 238, R) = 10, 231, R)NL'(Q, 23%,R) — (7.9d) 


bearing in mind (7.9a). Then, define the integral of an extended real- 


valued integrable function f € L1(Q, »; €, IR) as 


{fdé = f fdr +if fdr. (7.9e) 


Unlike the previous case (77), we do not attempt to enlarge the class 
of integrable functions for the simple reason that { fd£ must also re- 
present a complex measure and thus it must be finite. 
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(iv) The integral of a complex-valued function with respect 
to a complex measure. Let € = 1 + i) € @(Q, »’) and let f € 
C-1(Q, 2;C) such that f =u+iv and u,v e L1(0, 2;€,R) of 
(7.9d). Then, we say that f is integrable with respect to the complex 
measure € and write f € L'(Q, ©';€,C). Then, the integral of f is 
defined as 


Jfdé = fudy — fvdr, +i(fvdy + fudrvr), (7.9f) 
O 


7.10 Example. Replacing f with {1,4 we see that (7.9f) also 
defines a complex measure and therefore € +> /f fd€ is a linear 
operator from @C(Q, »’) to C(Q, 2’). 


Now, let C7'(Q,X7;C) denote the linear space of all ¥’-meas- 
urable complex-valued bounded functions. Let € = 1, +i be a 
complex measure on (2, »’). The integral in (7.9f), as a functional of 
f, 1s clearly linear. 


We define a norm on C;!(, 37; C) as 
IIflloo = supt{|f(w)| sw € Q}. 
Given a € € @(Q), 3’), and s being a simple function, 
8 =D ipaite1 ay, 
we have 
| fsdg| = [dopa an€(An)| 
< pa te lE(An)| S Dop-rllsll. (Aw)| 


< |Isll. Ill. 


If f €C,'(Q,5;R,), then there is a sequence {s,} of simple 
functions with s,, 7 f. Hence, 


[snd] < [Snlho EI S WFlloo III: 
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which leads to 


[SFE] < lI fll ll€ll < 20 


and obviously, to the same inequality for f € C71(Q, X;C). The 


latter implies that C,1(Q, ©';C) C £1(Q, ©; €,C). O 
PROBLEMS 
7.1 Show that the total variation of a complex measure € has the 


Ta 


13 


7.4 


Ts 


7.6 


Tad 


property |€) <v>+u> +f +v~ = |™%| + |r|, and that in 
general, || << v> +; +v 7 +v~. Thus, it disagrees with the 
associated property for a signed measure. 


Prove the statement: Let (Q,2’,€) be a complex measure 
space. If A and B are measurable sets such that A € B, then 


&(B\A) = &(B) — (A). 


Prove Proposition 7.3. Any complex measure € is continuous 
from below. 


Prove Proposition 7.4. Let (Q,2°,€) be a complex measure 
space and {A,,} be a monotone nonincreasing sequence of 
measurable sets. Then the signed measure € is continuous from 
above on {Ay}. 


Prove Proposition 7.5. Let € be a finitely additive set function 
on the measurable space (Q, 5) such that &(@)=0. € is 
continuous from below if and only if € is o-additive. 


Show that for any complex measure €, |€(A)| < |&|(A), for all 
Aes, 


Any complex measure € € @(Q, 2’) is finite. It means that for 
any measurable set A, |€(A)| < oo. (a) Does it mean that € is 
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7.11 


7.12 
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7.14 


7.15 
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bounded? (b) Let € =v, +i%, where 14,12 € G,(Q, »). 
Show that for any A € 37, |€(A)| < |||] + ||vo]]. 


Prove that for any complex measure € = 1, + iv», ||E|| < ||v1]| 
+ |[vell- 


Let €,€1,€ € C(O, ©’) such that for each A € ¥, |€(A)| < 
|f(A)] + |&2(A)]. Prove that |[€]| < [|x|] + llé2ll- 


Show that the real-valued function || - || introduced in Remark 
7.7 is a norm on @(Q), 3’). Namely, let (Q, 2’, €) be a complex 
measure space and let ||€|| : = |€|((2) be a real-valued function 
defined on @(Q, X’). Then, || - || is a norm. 


Let € = v + 7p be a complex measure on a measurable space 
(Q, 5’). Prove the following inequalities for total variations 
and norms. 


6] = Jy (l4l + lal) and [él > Je (lvl + llall). 


Let {€. = Um +%pn} be a sequence of complex measures on a 
measurable space ((2, 5’). Show that {&,} is a Cauchy 
sequence if and only if {v,,} and {p,,} are Cauchy sequences. 


Prove Theorem 7.8 that (@(Q, 57), || - ||) is a Banach space. 


Show that for each f € L1(Q,L,u;C), |ffdul < f|fldu, 
where ju is a positive measure. 


Show that, given a signed measure v, it holds true that 
L'(Q, 2,v;R) = L'(0, Z, |v|;R). 


Modify the Lebesgue dominated convergence theorem of 
Section 2, Chapter 6, and prove it for complex-valued func- 
tions. 
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TAT 


7.18 


7.19 


7.20 


7.21 


7.22 


7.23 


7.24 


Let (Q, 7,44) be a measure space, with ju being a positive 
measure. Prove that for each fe L1(0,2,yu;C), |€| = 


J\fldu, where € = f fdy. 


Let (Q, X’, 1) be a positive measure space and let f € C~'(C). 
Show that f fdj = (0,0) for each Ac S if and only if 
A 


f = (0,0) p-a.e. on 2. 


Let (Q,27,v) be a signed measure space and f € 
(Q, 3v,R,). Define p= [ fdvt — [fdv-, p, = ffdv*, 
and p2 = | fdv~. Prove that p; and py are the Jordan varia- 
tions of p. 


Let (Q,7,v) be a signed measure space and f € 
(Q, X';v,R,). Prove that f =0 v-ae. if and only if f fdv 
A 


= Oforall Ac ». 


Let (Q,7,v) be a signed measure space and f € 
L'(Q, ©;v,R). Prove that f = 0 v-ae. if and only if [fdv 
A 


= Oforall Ac ». 


Let (Q, 3’, 1”) be a signed measure space and r € R (a real con- 
stant). Show that rv is a signed measure on >’ and prove that 
L(Q, 2;v,R) = L'(Q, 2;rv,R) and that ffd(rv) = 
rf fdv. 


Prove the monotone convergence theorem for a monotone 
nondecreasing sequence of nonnegative measurable functions 
{fn} on a signed measure space pointwise convergent to 
f €L(Q, X;v,R,), namely, 


lifters | far —[ fdv. 


Let (Q, »’) be a measurable space and v and pbe two signed 
measures on (2, ’). Suppose v + p is defined. Show that it is 
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7.28 


7.29 
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a signed measure on ((2,57) and prove that for any 
fe LQ, Z;v,R) 0 L(Q, 2; ,R), 


ffdw+p) = ffdv + f fdp. 


Let (Q, »’) be a measurable space and v and pbe two signed 
measures on ((2, 2’) such that v + p is defined. Suppose a and 
b are two real constants. (7) Is the linear combination av + bp 
always defined? (iz) If av + bp is defined, show that it is a 
signed measure on (9,2’) and prove that for any f € 
L(Q, 2} V, R) M L(Q, D5 P; R), 


{fd(av + bp) = af fdv + bf fdp. 


Let (Q,4’,€) be a complex measure space and cE C (a 
complex number). Show that c€ is a complex measure on 
(Q, 2’) and prove that £1(Q, 17; €,C) = L1(Q, 7; c€,C) and 
that f fd(c§) =cf fd€. 

Let (Q2, 4’) be a measurable space and € and 7 be two complex 
measures on (2, 57). Show that € + 7 is a complex measure on 
(Q,2) and prove that for any f € L1(0,;€,R)N 
D'(Q, 3, R), 


Sfd(€+n) = [fdé+ ffdn. 


Modify Problem 7.26 under the assumption that f € 
LQ, 25 €,C) 1 L1(Q, 23m, C). 


Let €,7 € @(Q, 3’) and a, @ be two complex numbers. Prove 
that for any f € L1(Q, 1; €,C) N L1(Q, 2, C), 
Jfd(ag + Bn) = af fdé + Bf fdn. 


Let (2,27,v) be a signed measure space and f € 
L'(Q, ©';v,C). Prove that f = 0 v-ae. if and only if ffdv 
A 


= Oforall Ac »’. 
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7.31 Let (Q,2°,€) be a complex measure space and f € 


L'(Q, ©';v,R). Prove that f = 0 €-ae. if and only if [ fdé 
A 
= 0 forall Ac». 


7.32 Let vy € G,(O, X’). Show that f € L'(Q, ©';v,R) if and only 
if |f| ¢ £1(Q, Y;v, Ry). 


7.33 Show that (£1(Q, Y;v,R), +, -, X ) is a vector lattice over 
R. 


wz 


7.34 Let € € C(O, »’) (be a complex measure). Show that a function 
f € L'(O, X;€,R) if and only if |f| ¢ £1(Q, ©; €,R,). Also, 
define L1(0, 1; €,R 


za 
SS” 


7.35 Let € € @(Q, »’) (be a complex measure). Show that a function 
f €L'(O, ;€,C) if and only if |f| € £1(Q, ©; €, R+). 


za 
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complex measure, 458 

complex measure space, 458 

Jordan decomposition of a complex measure, 458 

total variation of a complex measure, 459 

measurable subpartition of a set, 460 

@(Q, 2’), vector space of all complex measures over field C, 462 

\|€||, norm of a complex measure C(2, »’), 462 

(@(Q, >’), || - ||), as a Banach space, 462 

measurability of a complex-valued function, 462 

integral of a complex-valued functions with respect to a positive 
measure, 462 

integrability of a complex-valued function, 462 

integral of an extended real-valued function with respect to a signed 
measure, 463 

space L(Q, X’,v;R), 463 

space L1(0, X; €,R), 463 

integral of an extended real-valued function with respect to a 
complex measure, 463 

integral of a complex-valued function with respect to a complex 
measure, 464 

Lebesgue dominated convergence theorem for complex-valued 
functions, 466 

monotone convergence theorem on a signed measure space, 467 
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8. RADON - NIKODYM THEOREM FOR SIGNED AND 
COMPLEX MEASURES AND ITS APPLICATIONS 


We continue our discussion on absolute continuity started in Section 
5 for positive measures. In this section we present a more rigorous 
version of the Radon - Nikodym theorem, along with detailed proofs 
and some common applications to probability theory, namely, the 
conditional expectation. 


8.1 Definition and Notation. Absolute continuity of signed 
measures is formulated in the same way as that of positive measures. 
Let js and € be positive and signed or complex measures, respective- 
ly, on a measure space (Q2, 5’). We call measure € absolutely 
continuous with respect to measure js (in notation, v < yj) if every 
measurable ji-null set A is also a €-null set. A signed or complex 
measure that is absolutely continuous with respect to the Lebesgue 
measure is called just continuous. 


A signed or complex measure € is said to be absolutely 
continuous with respect to another signed or complex measure 7, in 
notation € <n, if€ < |n|. O 


We start this section with some easy-to-prove preliminaries. 


8.2 Proposition. Let v be a signed measure and 1 be a positive 
measure on a measurable space (Q, 5’). Then the following 
statements hold: 


@) veep Sv <pand vo <p> |r| <p. 
(ii) vi<pandvyo <p S>v<yp and |p| <p. 
(a) p<pee <p and y- < pS v <a. 


8.3 Proposition. Let € = v + 1p be a complex measure and 1 be 
a positive measure on a measurable space (Q, 57). Then, the 
following statements are equivalent. 


(ij) €<p. 


(ii) |El<yp. 
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(iii) UT <p, vu <p, pp? KM, pK pv<p, pK po 
(See Problems 8.1 and 8.2.) O 


8.4 Notation. Recall that G(Q, »’) denotes the set of all signed 
measures on (2, %’) and St(Q, X’) is the subset of all positive 
measures. Given ps € Wt, denote Gi = {vy € G(Q, VY): uv < pf. 
Define on the space L(Q, 2’, y; R) the map J, _ Such that for each 


ge 4(Q, D1, p45 R), 


J.9 (= Jody) =v (8.4) 


that, according to Problem 6.1, is valued in G and, as easily seen, 
v < yt. Consequently, [L(Q, 2’, u;R) ,6< i. ]isanintomap. O 


pe? 


From Definitions and Remarks 5.7(2i7), Chapter 5, we recollect 
that the pi-almost everywhere property of equality of measurable 
functions penetates the equivalence kernel modulo E,, (or just 
modulo 1) on C~!(Q, ©;1R) and thus on its subset L(Q, », yu; R). 
Consequently, 


AQ, Ji Lb; R) 


E, — AQ, Diy pi R)|,, 
is also a quotient set. On the other hand, by Corollary 1.20, Chapter 
6, the map J, _ “agrees” with E,,. In other words, i, _ adopts E,, as its 
equivalence kernel. Then, by Theorem 4.4, Chapter 1, there is a 
unique reducer of J, _» denoted by 


[ AGL, 2. fi5 R Een ed, k 


such that 


i = f,°o7E, 


where 7g is the projection of L(Q, X’,y;R) on its quotient 
(Q, 57, 4; IR)|,, by E,. Therefore, J, _ turns to an injective map oi 
with the domain L(Q, 2’, 4; R)| 


we 
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K 


Recall that any function g from the quotient class [g],, of 
functions [generating the signed measure v in (8.1)] is referred to as a 
Radon - Nikodym density of (the signed measure) v relative to (the 
positive measure) 1. 


As in Section 5 for positive measures, the chief factor of the 
Radon-Nikodym theorem below is _ that, eB) _ minor 
restriction imposed on jy, the mapping [L(Q, 2’, ;R lee, Si ] is 


surjective. In other words, for each measure v € Gi, there is an 
equivalence class [g],, of Radon - Nikodym densities of v relative to 
[l. 

8.5 Theorem (Radon -Nikodym). Let pw € IN(Q, X’) be a o- 
finite measure. Then [L(Q, ©, 1; R)|,,, Gis, Sl is a bijective map. 


Proof. The proof of the theorem includes three objectives. 
1) Show that given [g],, € L(Q, ©, 4; R)|,, Sulg In € GS, that 


b? 

is, for each g € [g],,, v = {gdp defines a signed measure, absolutely 
continuous with respect to jz. This is done. (Because g € L, v is a 
signed measure as per Problem 6.1. The proof that v < yp is trivial.) 
Therefore, 


i AQ, J, fy; R)| = oF 


is an into map. 


2) Show that 


Je 4(Q, Di, pli RQ). sis 


is a surjective (an onto) map, that is, for every signed measure v € 


Cr which is absolutely continuous with respect to a positive o- 


finite measure ju, there is an equivalent class [g],, € L(Q, 2’, 4u; R)| 


bt 
of Radon - Nikodym densities of v relative to js. This is the key part 
of the theorem and it is referred to as the “existence of a Radon- 
Nikodym density.” 


3) Show that the map 
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Sui AQ, 37, 1; R)| =? Ga 


is injective (one-to-one); that is, the above equivalent class [g],, is 
unique. This is due to Corollary 1.20. 


It therefore remains to prove the existence of the Radon- 
Nikodym density, namely, given a signed measure v, absolutely con- 
tinuous relative to a o-finite positive measure jz, there is an 
equivalence class [g],, of L-functions such that for each element g of 
this class v = [ gdj. We break up the proof of the existence into five 
parts starting with the case of two finite measures js and v and 
embellishing it to the case when ys and v are o-finite positive and 
signed measures, respectively. 


Case 1: yz and v are two finite positive measures 


Abbreviate Li, = L'(Q, X, w;R,) and introduce the subset of 
functions 


o={yerh [fds nayvacs}. (8.5) 
A 


Because 0 € @, the latter is not empty. Furthermore, @ is closed 
under finite suprema. Indeed, let f, g © ® and A € 3’. Denote by 


B= {we A: fw) > gw} 
and 

G= {we A: f&) < gw)}. 
Then, # + G = A and 


SV gdu = ffdut fgdu < WE) +u(G) = v(A). 
A E G 


Now, let 


SoS un ffduf € o| < v(Q) < . 
Q 
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Then, there is a sequence {y,,} C © such that sup{ fy, du} = S. 
Indeed, because S < co, for each n = 1,2,..., there is a function 
(Yn such that S' — ~ < fyndu < S. By setting f,, =) i we form 
the new sequence {f,,}, which is monotone nondecreasing and has S 
as limn—soof fd. By the monotone convergence theorem we have 
that g: = limp—ofn, in the topology of pointwise convergence, and 
is an L} -function that obviously belongs to ®. The function g we just 
arrived at is an {-maximal element of and it is an element of the 
equivalent class [g],, of {-maximal elements. (For a pertinent discus- 
sion see Problem 8.11.) 


Now, we show that [g],, is an equivalence class of Radon - 
Nikodym densities of v relative to jy, that is, for each g € [g],, 


fgdp = (A), forall Ae Z. 
A 


Because v < ys and because for all A € »’ (from the definition of &) 


Sgdu < (A), 
A 

the set function 
p=v— fgdu 


is a finite positive measure, absolutely continuous relative to meas- 
ure jt. If g is not a Radon - Nikodym density of v with respect to p, 
then p #0 and thus p(Q) > 0. The latter implies that, for some 
positive e, 


pu(Q) — ep(Q) < 0. (8.5a) 
Consider the (finite) signed measure y = ~p — ep. By Theorem 6.12, 


there is a Hahn decomposition (P, N) of 2 such that y(A NM P) > 0 
and 7(A M N) < 0 for all measurable sets A. In other words, 


u(PMA)—ep(PNA)>0 (8.5b) 
and 
(NM A) —ep(N NA) <0. (8.5c) 
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If (NV) = 0 then, because of p < pt, pV) = 0 and thus y(V) = 0. 
On the other hand, from (8.5b), by setting A = 2 we have that 


w(P) — ep(P) = y(P) 2 0. (8.5d) 


Furthermore, because by the above assumption about p, N turns out 
to be a y-null set, it follows from (8.5a) that y(P) <0. This 
contradicts inequality (8.5d). Hence, (NV) must be positive. 


Now we have from (8.5c) that 


feivdy = 2 WANN) 
A 


< ANA) < (A) = (A) — figdu 
A 
or, equivalently, 


f (E4n + gdp < (A). 
A 


Thus, the function + 1y + g € ®. But, because j(N) > 0, it holds 
true that 


f(44n + 9)du> fodu. 


This contradicts that g is an [-maximal element of &. Evidently, the 
contradiction is due to the wrong assumption about p. Thus p = 0, 
or, in other words, 


V(A) = fgdu 
A 


for all A € 3’, which proves the statement of the theorem for this 
special case. 


Notice that because v is finite and thus every Radon - Nikodym 
density g is an L1-function, by Proposition 1.21, Chapter 6, g is finite 
ji-a.e. If it is “occasionally” infinite, we can redefine g to make it 
finite. Therefore, in the equivalence class [g],, of Radon - Nikodym 
densities there is a subclass of finite densities. 
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To summarize case 1, given two finite positive measures js and v 
such that v < js, there is a unique (nonempty) equivalence class 
[gu € £(O, ©, u;R,)|,, of Radon - Nikodym densities (of measure 
vy relative to measure jy), with a nonempty subclass of finite 
densities. 


Case 2. wz and v are finite and o-finite positive measures, 
respectively 


If v is o-finite then there is at most a countable decomposition of 
(= 22,0, such that o(0,,) < co for all = 1,2,..: «Let 


Vn = Resyno,v. 


Then v,, is a finite measure on (2,,% 5’ and from case | it follows 
that there is a measurable nonnegative function g,: 2,, — R such 
that 


DANO) = | Ged, foreach Ae 2, 2H 1,2. 4... 
ANO, 


Now by the monotone convergence theorem applied to the sequence 
{X""_, dn1o, } we have that 


WA) = Vint Yn(A 1 On) 


=e oS ondu = JOP gnlo,du. 
ANQ,, A 


It only remains to set g = U2, gn1q, to complete this part of the 
theorem. 


Therefore, given two positive measures j1 and v such that 1 is 
finite, v is o-finite and v < py, there is a unique equivalence class 
[9], € L(Q, ©, w;R+) |, of Radon-Nikodym densities (whose 
integral is not necessarily finite). 


Case 3. yw is a finite positive and v is an arbitrary positive 
measure 


Denote by 
I ={B eS: Resynpy is o-finite}. 
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In as much as @ € I’, it follows that 4 ©. Let 
S = sup{u(B): B € I} 


and let {E,,} C I such that u(E,) - S.(0S < p(Q) < co, for each 


nm=1,2,... « “Therefore, there ts a set &, such ‘that 
S—1< WE,) < S.) Clearly, 
E:=UEneT. 
n=1 


Hence, S > pw(E) > w(E,) - S and n(Z)= S. 


Now because v is o-finite on FE’, from case 2, it follows that 


there is a ©’ M E-B,-measurable L'-function [E, R,, 9], such that 


ve(A) = (ANB) = fun Gu 


forall A € »’. Fix an A ec ». 


a) Let (ANE) > 0. If (AN ES) < co, then ANES ET, 
and thus F U (An E*) € I. The latter yields that 


S > wE + (AN E%)) = WE) + WAN B) > WE) 


and this contradicts w(E) = S. Thus (AN E*) = co. 


b) Let u(AN E*°) = 0. Then because v < p, it holds true that 
WAN E*) = 0. 


The above cases a) and b) can be combined in one compact 
equation 


vV(A M EY’) = [oo Lange dis 
by agreeing that oo - 0 = 0. Furthermore, 


V(A) = VAN F)+uU AN EY) 


= fang G du + foo Lane du = fy gdp, 
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where g = 9 1g + 0c01¢-. Notice that g is measurable, because 
ol TE) Ue We) ey. 
Therefore, given two positive measures j1 and v such that ju is 


finite, v is arbitrary, and v < ju, there is unique equivalence class 
[a], € L(Q, ©’, u; R+) | ,, of Radon - Nikodym densities. 


Case 4. yz is a o-finite and v is an arbitrary positive measure. 


Let Q= UP50, such that u(Q,) < oo for all n > 1. Due to 
Case 3, for each n, there is a 1 0,,-B,-measurable function [Q,, 
R, G,,], such that 


vV(A a) Qn) — aces Gy; djs, 
for all A € 3X’. Denoting g, = G19, we have 


VAN On) = f gn de 
A 


and thus 
v(A) = J gdp, 
A 


where, by the monotone convergence theorem, g = UPS 1 gn. 


Therefore, given two positive measures ju and v of which ju 1s o- 
finite and v is arbitrary and v < py, there is a unique equivalence 
class [g],, € L(Q, Y, pu; R)|,, of (nonnegative) Radon -Nikodym 


densities. 


Case 5. ys is a o-finite positive measure and v is a signed 
measure 


Let v = v* —v~ be the Jordan decomposition of v, where, for 
instance, v” is supposed to be finite. By Case 4, there are functions 


[Q, Ry, gi] each belonging to their respective equivalence classes 
[gil + % = 1,2, such that 


v*(A) = f gidw and v~(A) = f godu, AE YL. 
A A 
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In as much as by our assumption v’~ is finite, go is j:-integrable and 
g2 < © pi-a.e. and g: = gi — go makes sense. This leads to 


(A) = v+(A)—v-(A) = f gd. 
A 
Thus, we can say that g belongs to an equivalence class 
[gl — [91], = [92 , = {1 92° Hn € [91] +92 S [92], } 


of Radon - Nikodym densities. 


In summary of Case 5, given a o-finite positive measure js and 
signed measure v, with vy < ju, there is a unique equivalence class 
[a], € L(Q, Y, 4; R)| ,, of Radon - Nikodym densities of v relative to 


jl. 


Case 5a (special case of 5; v is a finite signed measure.) 


In this case, clearly, given a o-finite positive measure ys and a 
finite signed measure v, with vy < yu, there is a unique equivalence 
class [g],, € L1(Q, ©, u;R) | ,, of Radon-Nikodym densities of v 
relative to ju. 


Case 5b (special case of 5; v is a o-finite signed measure.) 


In as much as v is o-finite, there is a countable decomposition 
Ur On = Q such that 1, = Resspqq,v is finite for every n. By Case 
5a, because 


In K Ln = Res sno, He; 


there is a unique equivalence class [9nlu € 
D(Qn, 2 AQn, Mn; R) |, of Radon-Nikodym densities of v;, 
relative to 1. Now, letting 


g = So ees 


for every gn © [Gn]p,,, we define the class [g],, of Radon - Nikodym 
densities of v relative to 4. As a sum of countably many integrable 
functions, g is clearly ju-a.e. finite. 


The proof of the theorem is now complete. O 
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8.6 Definition and Notation. By the Radon - Nikodym theorem, 
the map a is therefore invertible and its inverse Si is also a map 
valued in L(Q, ©’, 4; R)|,,. In other words, for any v € Gj, under 
i ie there is a nonempty equivalence class [g],, of Radon - Nikodym 


densities of v relative to 44. We denote 


i by the symbol 4 a 


and for a fixed v € GS 


ny We Set 


ie '(V) = ii 


and call it the Radon - Nikodym derivative of measure v relative to 
measure [t. We emphasize that a Radon - Nikodym derivative is not 
the same as a Radon - Nikodym density, but is an equivalence class 
of Radon - Nikodym densities. O 


The following is a useful proposition and at the same time, an 
application of the Radon - Nikodym theorem. 


8.7 Proposition. Let j: be a o-finite positive measure and v a 
signed measure on a measurable space (Q, 5’) such thatv < . 


Then there are Radon - Nikodym derivatives %,& |“ such that 


du? du? du 
dv|* _ at [av] _ ave 
du “du? | du ~ dy? 


In other words, given an f ev yp and its decomposition f = f* — f~ 


it holds that f* € ee and . € ars where vt and v~ are the 


Jordan variations of v. 


Proof. Because v < ju, by me Radon - Nikodym theorem, there is 
a Radon - Nikodym derivative 4 da Which is an ss are class [/f],, 


of all Radon-Nikodym densities modulo yp. If f ew _, then, by 
Problem 6.18, the Jordan variations of v are 


w= (fo dpand v= [7 adp (8.7) 
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Equations (8.7) hold for any Radon - Nikodym densities f* and f~. 
Without causing any confusion we write them for the entire Radon - 
Nikodym classes (i.e., derivatives) and we do it on occasion. So, we 
have (8.7) in the form 


yee fe] du and y- = =fie] dus. (8.7a) 


On the other hand, by Proposition 8.2, v7 < yu and v~ < yw. Thus, 
by the Radon - oe theorem, there are Radon - Nikodym deriva- 


tives a and aT of v* and v~ with respect to jz. Therefore, for each 


gE ti and he #2 ~ we have 

vt = [gdp and v- = fhdu. (8.7b) 
Comparing (8.7) with (8.7b) we have 

fod = fftdp and fhdu= f[ f-dp, 


where the two equations obviously hold on 2’, i.e. for each A € »’. 
By Corollary 1.21 (since ju is o-finite), 


g=ft andh=f p-ae. 
In other words, g and f* and h and f~ belong to the same 


equivalence class, respectively, of Radon - Nikodym densities, since 
the latter is unique. Consequently, 


d dvt d d 
je] = 42 and fe) = 4 Oo 
8.8 Remarks. 


() Notice that for each f € . we have fte a and 


fieé€ ia However, although f = f* — f-, it is incorrect to write 


dv _ dvt _ dv— 
du du du 
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or in the form 


dv _ [av]? _ [av] 
du | du du}? 


because it may cause different elements (such as f* and g~) of the 
respective classes appear in one combination. This is why most often 
we use two different notations for derivatives and densities. 


(ii) Since v is a signed measure, one of its Jordan variations, 


say v~, is finite. Consequently, a C L. Thus, a is jl-a.e. finite on 


dv_ 
“dy 


Radon - eva la theorem, each Radon-Nikodym density of the 


class 4 ~ 1s p-a.e. finite and so are ae and nn : O 


(2, whereas need not to. If v is o-finite, fe by case 5b) of the 


After a short break filled with various intermediate results and 
notions we return to a version of Radon-Nikodym's theorem for 
complex measures. This is readily done as follows. Firstly, given a 
o-finite positive measure js € IN(Q, B’), we denote by 


Cr = {€ € CQ, LY): € < p}, 


where @(Q, X’) denoted the set of all complex measures on ’. Let 
EE ce and let €=1, +i. By Proposition 8.3, 1, << and 
V2 < ys and 1, and 1% are finite signed measures, thus according to 
Case 5a of the Radon- Nikodym Theorem, there are two equivalent 
classes [g;],, and [g2],, of Radon - Nikodym densities from the factor 
space [1(Q, 7, y; R)| ,, SO that, for any two elements g; and g» of 


dvy dv2 
du and ds? 


their respective classes 


(A) = fgidw and 1(A)= fgodu, foreach Ac 2X, 
A A 


thereby making [g],,: = a 1 x oT the targeted Radon - Nikodym 


derivative. The uniqueness of [g],, is based on that in Theorem 8.5. 
Summarizing the above arguments we have the following. 
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8.9 Theorem (Radon - Nikodym for complex measures). Let ju 
EM(Q,X) be a o-finite measure. Then [L'(Q, 2, p;C)| 


e< 


fe? 


fe? 


I | is a bijective map. O 


8.10 Theorem. Let i be a o-finite positive measure and v be a 
signed measure, both on the same measurable space ({2, 5/) such 
that v < ju. Let g be any function from L'(Q, 7;v,R). Then, there 


V 


is a ae Radon - Nikodym derivative class a such that for any 
ean 


Jgdv = fofdu. 
Proof. 


(i) Let_v_be a positive measure. According to Case 4 of the 
Radon - Nikodym theorem, there is a nome sate Radon - Nikodym 


derivative class n such that for each f € ne v = | fd. We can 


also write it in the form v = Ja dv “dy. considering that the equation 
holds for each element of class 4 a 


In other words, v < js implies that for each measurable set A, 
A) = {fdu (or v(A) = fad) (8.10) 


holding for each f € am We consider four cases of function g begin- 


ning with an indicator function. 


(a) Let g = 14, for some A € »’. Then, by (8.10), 
=|lady = fodv= \lafde= |ofdp. 


(b) Let g be a nonnegative simple function, g = )>%",a;14,. 
Then, by (a), 


[ody = 3,0; [lady = Mai fla, fdy 


=| of: 
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(c) Let g € C;'. Then, there is a nondecreasing sequence {g,,} 
of nonnegative simple functions convergent to g pointwise on 1). 
Thus 


Jgdv = supf gndv = supf gn fdy 
then use the monotone convergence theorem for positive measures 
= fofdp. 
(d) Letg €C~'(R). Then, g = gt — g~ and due to (c), 
Jgdv = fgtdv— fg-dv = fg" fdu— fg fd 


=f(9' —g9 )fdu = fofdp. 


The equation holds true for each f € a Thus we can also rewrite 
this equation in the form 


Sgdv = fo%du. 


(ii) Let _v_be a signed measure, with the Jordan decomposition 


v=v*—v-. Then, by Proposition 8.7, each f € {% has the 


+ 
positive and negative components f* € FA = ae and f  € 


ae] = ie Thus, for a g € L(Q, Y;v,R), utilizing case (i) in 


the integral decomposition 
fgdv = [gdvt — fgdv- 


separately for [gdv* and [ gdv~, we collect two densities f* and 
f~ of v* and v~ with respect to jz as two portions of f. The latter, 
itself, is a Radon - Nikodym density of v with respect to ju. Thus, 


fodv = foftdu— fof-du = fo(ft — f-)du 


= fofdu. 
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The last equation holds for each density f € -, therefore, we can 
rewrite it in the form 


fodv = foiidp Oo 


8.11 Theorem. Let ji: be a o-finite positive measure and € be a 
complex measure, both on the same measurable space (Q, 5’) such 
that €<yp. Let g be any function from L*(Q,2;€,R) or 
iQ, Pe ;€,C). Then, there is ; age Radon - Nikodym derivative 
class 4 a © such that for any f € =, 


Jgdé = fgfdu. 


Proof. 


(i) Now, let € =v, +iv2 be a complex measure, absolutely 


continuous with respect to jz. Then, fora g € L'(O, ©; €,R), 
Jgd& = [gd + if gdrvr. 


Furthermore, by the Radon-Nikodym theorem for complex 
measures ee 8.9) we have a Radon-Nikodym density 


f=ut+i e& * such that w € oa and v € om Applying case (i7), 


we arrive at 
Sod = fgudu+if gud = fg(ut iv)du 
= Jofdu 
or in the form 
Sodé = fogdp. 


(ii) Let € be a complex measure such that €< yu and let 
g=91 + igo €C1(C). According to part (i) we have a Radon- 
Nikodym density f € oa x oe such that regardless of the choice of 


any real-valued measurable function h € L1(Q, X;v,R), 


[hd = fhfdp. 
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Thus, replacing h with g; or g2 we have 
S(g + ige)dé = fofdut+ifgofdu = fgfdu 


or in the form 
Jgdg = fod du O 


8.12 Theorem (The Chain Rule). Let j: and p be two o-finite 
positive measures on a measurable space (Q,+') and let € be a 
signed or complex measure on (Q, 5’). Suppose £ & u< p. Then 
the following equation holds for Radon - Nikodym derivative classes: 


d& _ d& du 
do du dp’ 


Proof. Suppose that <p such that jz and p are o-finite 
positive measures and h is a complex function from L1(Q, ©; 1, C). 


Then, by Theorem 8.10, there is a density function f € “e such that 
fhdu = fhfdp. (8.12) 
or in the form of 
hd = fhZdp. (8.12a) 


Now, if € is a complex measure such that € << p, then for each 
he a (8.12-8.12a) hold, and because now [hd = €, we have 


€= fhdw= fhfdp (8.12b) 
or in the form of 

€= fRdu = fa hee. (8.12c) 
On the other hand, by transitivity € < p, there is a yp e& * such that 


E= fdp (8.12d) 
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or in the form of 


€= f £dp. (8.12e) 
Comparing (8.12b-c) and (8.12d-e) we arrive at the equations 
fedp = Shfdp (8.12f) 
or 
d 4 d&é d | 
Sade = Sade (8.12g) 


written in terms of Radon-Nikodym derivative classes. The two 
integrals are integral measures that are equal on »' and p is o-finite. 
We remake Corollary 1.21 modifying it for complex-valued 
integrands y and hf when applying it separately for the real and 
imaginary parts, concluding that 


p=hf p-a.e. on. 


When equating the associated equivalence classes (i.e., Radon - 
Nikodym derivatives), we have the following 


dé _ dé dp 
dp du dp 
which is reminiscent of the chain rule in calculus. O 


Now, let us consider a series of applications of the Radon - 
Nikodym theorem in probability theory. 


8.13 Example. Let X be a random variable defined on a proba- 
bility space (Q, XY’, P). Suppose (R, B(R),PX*) is the probability 
space induced by X, where PX™ is the probability distribution of X. 
Given a o-finite positive Borel measure jz such that PX* < p, 
according to Case 4 of the Radon-Nikodym theorem, there is a 


nonempty equivalence class ie € L(Q, 2, uw; R+)| , of nonnega- 
tive Radon - Nikodym densities nik that for each g € ae it holds 
true that dPX* = gd. 
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In particular, if 4. = A is the Borel - Lebesgue measure, then the 
probability distribution PX*, which is assumed to be continuous 
(i.e., absolutely continuous with respect to A) can be represented by 
the Lebesgue integral [ gd, and the Radon-Nikodym density g is 
referred to as the probability density function (pdf). 


Furthermore, F'(x) = PX*((—0o, z]) is the probability distribu- 
tion function of X, which then can be represented by the integral 
J" .,g(u)A(du). In [Dsh2] we learn that by the Lebesgue theorem, 


F is d-a.e. differentiable in the usual sense and its Newton - Leibnitz 
derivative F” equals g \-a.e. O 


8.14 Remarks and Definition (of the Conditional Expecta- 
tion). 


(i) Another consequence of the Radon-Nikodym theorem, 
which is at the heart of stochastic analysis, is the notion of the 
conditional expectation. Let X € L(Q, 2’, P;R, B(R)) be a random 
variable and suppose y C »’ to be a sub-c-algebra. Denote Py = 
Resy,P and let v9 = { XdPo. Then, 1 is a signed measure on Xp, 


absolutely continuous with respect to Po. 


Thus, according to Case 5 of the Radon - Nikodym theorem, there 
exists a unique Radon-Nikodym derivative (equivalence) class 


oe € (Q, Xo, Po;R,B(R))|,, of random variables (Radon - 


Nikodym densities), referred to as the conditional expectation of 


random variable X given sub-o-algebra Sip, in notation, E[X |]. 


Any random variable X € ae is called a version of the conditional 


expectation. 


Now, as a Radon - Nikodym density of 1 with respect to Py, Xo 
is a part of the equation 1) = [{ XodPo holding true on y. But so is 
X through vp = { XdPo as well, originally defined on Xo. Thus, we 
have 


[XodPy = [XdPo, forall A € Xp. (8.14) 
A A 


In particular, with A = Q and Po, lawfully replaced with P on the 
right of (8.14), we have 
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EX) = EX. (8.14a) 


(ii) In the literature on probability, E[X|o] is often confused 
with the above Radon - Nikodym density Xo. For notational conve- 
nience, sometimes we comply with the commonly used notation, 
however bearing in mind that E[X|o] is a class, not an individual 
Radon -Nikodym density. For example, if we replace Xo with 
E[X |p] in equation (8.14a), the latter reads 


E[E[X|Zp]] = EX (8.14b) 


which is the well-known law of double expectation. 


(iiz) For an event (measurable set) A, let XY = 1,4. Then, the con- 
ditional expectation E[1,4| Xo] is called the conditional probability of 
event A given sub-o-algebra Sy, in notation P(A|o). 


(iv) If Y is a random variable on the same probability space 
(Q, 7,IP),then Y is measurable with respect to the o-algebra 
Xo: = Y**(B(R)) it generates in ©, and in order to be ¥- 
measurable, 4/9 must be a subset of »’. Now, the conditional 
expectation of X given this sub-c-algebra in probability theory is 
often denoted by E[X|Y]. O 


The following construction explains why E[X|o] is called the 
“conditional expectation.” 


8.15 Examples. 


(i) Let X be a random variable on a probability space 
(Q, ©’, P) and let H = {H, Ho,...} be a measurable partition of 2 
such that P(H,,) > 0 for each n € N. Then, 


By {2, @,A=S-H;: ren} (8.15) 
ied 


is a o-algebra generated by H. Clearly, X» C 2X’. Let Po = Resy,P 
and define the random variable Xp on the probability space 
(Q, 05 Po): 
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Xo = (eth fxa ») 17, (8.15a) 
For 

tine (rch, JX ») ie 
we have 


E|Xo1z,| = | Xod Po = [Xd = E[X1z,], k= eee Drees 
Aly, FI; 


Next, for an A € ¥’o, we calculate for 


Xol4 = Xolyz, => (echo, f x4 | Lu, 
H, 


ied neJ 


the expected value 


E[Xola] =o fXdPy = [XP = E[X14]. 
néJ H,, A 


Hence, 


19 = { XdPo = f XodPo on Xp 


and therefore, by the Radon-Nikodym theorem, Xo is a version of 
the conditional expectation E[X|Xo]. In fact, in this particular, case, 
this is the only version of E[X |]. This is because the o-algebra Xp 
is too crude and it does not contain any null sets other than the empty 
set. Suppose NV is a P-null set, not listed in 3p, and let 


X=X)+I1y. 


Then, obviously, X= X, P-a.s. However, X is not a version of 
E|X|Xo], because X is not Xop-measurable to begin with. 


(ii) Now, let us include N in 3p. To simplify the construction, 
assume that H = {Ho, Hi,...} is a measurable partition of 2 such 


492 CHAPTER 6. INTEGRATION IN ABSTRACT SPACES 


that Hp = N and thus P(Ho) = 0, whereas P(H,,) > 0,n € N. We 
modify 2’ in (8.15) as 


= {2, @,A=S-H;: rena} (8.15b) 
ied 


and define Xo exactly as in (8.15a). Now, obviously, 


E[Xo1y] = E[X1y] =0 


and therefore, again we can show that 


= { XdPo = { XodPo on dip. 


However, in this case X= Xo +1y is yp-measurable and X turns 
out to be yet another version of the conditional expectation E[X| Xo]. 


(iiz) In the context of the above constructions, let B and H be 
two events such that P(H)>0. We define the conditional 
expectation 


E[X|H] = Pun | XaP. (8.15c) 
In particular, for the random variable 1g we have 

E BoH 

E[tp|H] = “oa. (8.15d) 


The latter agrees with ie conditional probability P(B|H). Now, if in 
(8.15a) we replace ray 9 Fe with E[X|H,,] as per (8.15c), we 


can rewrite (8.15a) as 


Xo = ,E[X|Ap]|1 x, (8.15e) 


n=1 


Xo is the only version of the conditional expectation, thus we can 
also write 


E[X|Xo] = 72, E[X|An]12,. (8.15f) 
O 
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In closing on applications of the Radon-Nikodym theorem to 
probability we offer several useful propositions that the reader can 
prove with relatively little effort. 


8.16 Proposition. Let X € 
variable, iy © ' a sub-o-algebra, and 
o-a.s. if X is 3ig-measurable. 


E[X |X| 


8.17 
X,Y eEL(Q, 


2, P; 


R, B( 


= Ress, 


P:R, B(R 


Po = Ress, 


. Then, for any two real numbers a and b, 


E[aX + bY|Xo] = 


aE[X|Xo] + 


bE[Y |] 


P- 


a.s. 


)) be a random 
P. Then, X = 


Proposition (linearity of the conditional expectation). Let 
R)), let Sy C SX’ be a sub-c-algebra, and 


8.18 Proposition (monotonicity of the conditional expecta- 


tion). Let X,Y € 


algebra, and 


(2, 5, P:R, B( 
Po = Ress,P. Then, fxsY 
E[X |X] < E|Y Xo| Po-a.s. 


P-a.s., 


R)), let Xo CX be a sub-o- 


8.19 Proposition (monotone convergence theorem for the con- 


ditional expectation). Let X,, Xo,... 


be a monotone nondecreas- 


ing sequence of nonnegative r.v.'s from L(Q, ©’, P;R,, B(R,)), let 
3p C 3! be a sub-o-algebra, and Py = Ress, P. Then 
limp socE[Xn|5o] = E[limysooXn|Xo] Po-a.s. 
8.20 Proposition. Let X,Y, and XY € (2, x, P;R, B(R)), let 


ig © & be a sub-o-algebra, and Po = Resy,P 
measurable, 
E[XY|Xo] = YE[X|Xo] Po-a.s. 


: Then, if Y is Sip- 


O 
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PROBLEMS 


8.1 


8.2 


8.3 


8.4 


8.5 


Prove Proposition 8.2. Let v be a signed measure and [4 - a 
positive measure on a measurable space (Q,). Then the 
following statements hold: 


4) vep Sv <pand vw <p > |p| < p. 
(it) vi <pandvo <p Sv<yp and |p| <p. 
(iit) \vol<psavt<pandvy <p svc. 


Prove Proposition 8.3. Let € =v-+ip be a complex measure 
and | a positive measure on a measurable space (Q, 5’). Then, 
the following statements are equivalent: 


(ij) <u. 
(ii) |El<p. 


(41) UT <M p,v <p, pt? <p, p <wv<Ky,p <u. 
Let (Q, 5’, €) be a complex measure space. Show that € < |€]. 


Let € € @(, »’). Show that the Radon -Nikodym derivative 


rin of € with respect to |€| exists and that the modulus of any 
dg 


Radon - Nikodym density from 77 equals 1 |E|-a.e., where 1 is 


the function defined as 1(w) = 1,w EQ. 


Consider in Case 1 of the Radon-Nikodym theorem, the 
partial order = on @ by defining f = g if and only if f < g 
ji-a.e. Show that any chain in @ has an upper bound and thus, 
by Zorn's Lemma 4.13, Chapter 1, @ has a maximal element. 
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8.6 


8.7 


8.8 


8.9 


8.10 


8.11 


8.12 


a € € C(O, X’). Show that the Radon-Nikodym derivative 
a exists and that for each g € L1(Q, X; €,C), 
ode = foqe ale. 


Let py and p be two o-finite signed measures on a measurable 
space (Q, »’) such that 4 & p (meaning ww < p and p < 1). 
Show that 


= 
A = (¢) ji- (Or p-) a.e. 


Show that C's is a linear space over the field C. Does the same 
apply to o<? 


Let (Q,2’,P) be a probability space under the following 
assumptions. Let Q = [0,1], © =6BN [0,1], and P = RessA 
(where A denotes the Borel- Lebesgue measure). Let H = 
{H,,...,H,} bea finite partition of Q such that 


Ay = [A oa k=1,. 
and let Xo be the smallest c-algebra generated by H. Let 
X(w) =w, for all we. Calculate a version Xo of the 
conditional expectation E[X |X] and give all other versions of 
E[X| Xp]. 


Prove Proposition 8.16. Let X € L(Q, Y,P;R,B(R)) be a 
random _ variable, dig C Xa ~sosub-o-algebra, and 
=Resy,P. Then, X =E|[X|X] Po-a.s. if X is So- 
measurable. 


Explain exactly why X 4 E[X|5o] if X is not Xo-measurable. 


Let X be a random variable on a probability space (Q, »’, P) 
such that X = 1 P-a.s. on 2. Let Xp be a sub-c-algebra of »7 
on 2 and Po = Resy,P. Show that X = E[X|Xo] Po-a.s. 
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8.13 


8.14 


8.15 


8.16 
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Prove Proposition 8.17. Let X,Y € L(Q, Y,P;R, B(R)), let 
2g © & be a sub-o-algebra, and Py = Resy,P. Then, for any 
two real numbers a and b, 


E[aX + bY |X] = aE[X|Xo] + bE[Y |X] Po-a.s. 


Prove Proposition 8.18. Let X,Y € L(Q, Y,P;R, B(R)), let 
dip C & be a sub-o-algebra, and Po = Ress,P. Then, if 
X<Y P-as., 


E[X |X] < E[Y |X| Po-a.s. 


Prove Proposition 8.19. Let X1,X9,... be a monotone non- 
decreasing sequence of nonnegative r.v.'s from 


(Q, 0,P;R,,B(R+)), let %9 C © be a sub-c-algebra, and 
Py = Resy,P. Then 


limp oc [Xn|Xo] = Ellimn—ocXn|X0] Po-a.s. 


Prove Proposition 8.20. Let X,Y,and XY €L(Q,2,P 
R, B(R)), let Xo C Y be a sub-c-algebra, and Py = Resy,P. 
Then, if Y is 3ig-measurable, 


E[XY|X] = YE[X|X] Po-as. 
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NEW TERMS: 


absolutely continuous signed or complex measure with respect to a 
positive measure, 471 

absolutely continuous signed or complex measure with respect to a 
signed or complex measure, 471 

Gr = {v € G(Q, X):v < py}, set of all signed measures 
continuous relative to ju, 472 

[L(Q, 27, u; R)I,,, OF, Sul bijective map between L functions and 
signed measures continuous relative to 4, 472 

Radon - Nikodym density of a signed measure, 473 

Radon - Nikodym theorem for a signed measure, 473 

Radon - Nikodym theorem for two finite positive measures, 474 

Radon - Nikodym theorem for 1 finite and v o-finite positive 
measures, 477 

Radon - Nikodym theorem for ju finite and v arbitrary positive 
measures, 477 

Radon - Nikodym theorem for js o-finite and v arbitrary positive 
measures, 479 

Radon - Nikodym theorem for 1 o-finite and v arbitrary signed 
measures, 479 

SY” = om Radon - Nikodym derivative of a signed measure, 481 


dv dvt dv— : : : : 
a Radon - Nikodym derivatives of signed measure v and 


Jordan variations with respect to ju, 481 

CS = {€ € C(Q, X’): € < pu}, set of all complex measures 
continuous relative to ju, 483 

Radon - Nikodym theorem for complex measures, 484 


[E027 780), CF 


ee 5, , bijective map between L! functions 
and signed measures continuous relative to ju, 484 

the chain rule, 487 

probability density and probability distribution functions, 489 

conditional expectation given a sub-c-algebra, 489 

the law of double expectation, 490 

conditional probability of an event given a sub-o-algebra, 490 

linearity and monotonicity of the conditional expectation, 493 

monotone convergence theorem for the conditional expectation, 493 
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9. MEASURES AND INTEGRATION 
ON FINITE PRODUCT SPACES 


The present section extends the results on integration to Cartesian 
products. It discusses the formation of product o-algebras (which has 
some resemblance to the product topology) and product measures on 
them. This leads to the main result of this section, the celebrated 
Fubini's theorem, which allows one to iterate multiple integrals as its 
measure-theoretic analogue of multiple Riemann integrals. 


Many textbooks in analysis and on the history of mathematics 
adopt “‘Fubini's theorem” as a generic name for a class of theorems 
establishing the identity of multiple integrals with iterated integrals. 
In the mainstream of the evolution of calculus, when integrating a 
function f on the rectangle R = [a,b] x [c,d], the question was 
raised: under what condition does the existence of the double integ- 
ral [ { fd(x, y) guarantee the existence of either of the iterated integ- 

R 


rals, f’{f f(a, ydy}da and f“{[? f(a, y)dx}dy, and will they all 
be equal? Fubini's theorem, in one of its earlier forms, was proved by 
Augustin-Louis Cauchy in the early nineteenth century and applied 
to continuous functions. In 1904, Henri Lebesgue extended this 
result to bounded measurable functions. In 1906 Beppo Levi conjec- 
tured that f need not be bounded, but just integrable. The Italian 
Guido Fubini (1879 — 1943) proved this statement in 1907. Namely, 
he proved that given the function f is integrable on R, the functions 
xt» f(x,y) and y+ f(a, y) are integrable for almost all x and y, 
respectively. In addition, the functions yt> i f(x,y)dx and 


Lhe fi f(x, y)dy are integrable and 
[J Fdw,y) = ode [EF @, vay} = [od [Fe yaa}. 


Fubini, however, imposed an unnecessary condition on the integrand 
function. This was corrected and refined independently by the Italian 
Leonida Tonelli (1885 — 1946) in 1909, Brit Erest W. Hobson 
(1856 — 1933), and Belgian Charles J.G.N. (Baron) de la Valée- 
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Poussin (1866 — 1962) in 1910 who rendered their proofs entirely 
differently from that of Fubini. 


The notion of multiple integrals goes back to as early as the 
middle of the eighteenth century, first in the form of an indefinite 
integral. Later on, by 1770, Leonard Euler formalized the double 
integral on a bounded domain and applied the above formula for 
iterated integrals by justifying it in terms of Riemann sums. 
Functions to be integrated were assumed to be continuous, and the 
area of integration was fairly basic. This approach began to 
encounter serious conflicts as soon as more general cases were 
considered. Not until Henri Lebesgue published his famous thesis in 
1902, did it become possible to tackle other classes of functions, 
which all led to Fubini's theorem of its 1910 version. More general 
versions of Fubini's theorem (which we explore in this section) in 
abstract spaces, applied to abstract measures and integrals, became 
accomplishable after the Johann Radon's extension of the Lebesgue 
integral in 1913 (mentioned in Section 5). 


We begin this section with the notion of “product o-algebra” 
which has many similarities with the formation of a product topology 
with finitely many factor spaces. As in the event of a product topol- 
ogy, we define the product o-algebra on the space of rectangles and 
show that it is identical to the o-algebra relative to which all projec- 
tion maps are measurable (continuous in the case of the topology). 


9.1 Definition. Let (Q;, 5’;), i = 1,...,n, be measurable spaces. 
Denote 


kop= {TAs Ave Zyi= 1. nh (9.1) 


i=1 


and call it the set of all measurable rectangles in Q: = [[ Q;. The 
i=l 


o-algebra on ( generated by x 5}; is called the product o-algebra 
and is denoted by 


5525 = (X51). (9.1a) 
f= i= 
Oo 


500 CHAPTER 6. INTEGRATION IN ABSTRACT SPACES 


9.2 Theorem. >’, is the smallest o-algebra on \) relative to 
which all projection maps are measurable, that is, for each 
i=1,...,n, 1; is Xig-d;-measurable. In other words, 


Ses (Unrz)). (9.2) 


Proof. (i) Let 21 be a o-algebra relative to which all projection 
maps are measurable. In other words, 7**(57;) C 21. We show that 


x 3); C YA. Indeed, if R is a measurable rectangle (i.e. R € x Xi); 


R=) 7;(A;) such that A; € ¥; and since 77(A;) € MI, R € A. 
i=1 


(ii) Because x 5); is a generator of 37g, we have ig C AL. 
i 


(iit) We show that 7; is ¥’,-’-measurable. Indeed, if A; € »;, 


* 


the measurable cylinder 7; 


(A;) € x »;, because it also is a 
= 


measurable rectangle. Thus, 7**(37;) C x di, © Lg. 


(iv) In conclusion, because ’g C 2, and since 2 is a c-algebra 
relative to which all projections are measurable, »’z is the smallest 


of all such o-algebras. On the other hand, »’ (U T “(z)) is yet 
i=1 


another smallest calgebra relative to which all projections are meas- 
urable. O 


Consequently, we say that ’, is generated by the projection 
maps. We recollect that the Borel c-algebra on R” was generated 
half-open n-dimensional intervals (called rectangles) and that the 
collection of all such rectangles was a semi-ring. It seems plausible 


that x 2’; is also a semi-ring. 
i=1 


9.3 Lemma. Let A, B € x 3);. Then, A\B can be partitioned 
with a finite union of pairwise disjoint measurable rectangles. 


Proof. (i) For n = 2, we have from Problem 5.9, Chapter 1, that 


9. Measures and Integration on Finite Product Spaces 501 
A\B = [(A1\B1) x Ay] + [(Ai a B,) x (A2\ Be)). 


According to the same problem, there is yet another, alternative, 
decomposition of A\ B: 


A\B = [A, x (Ao\Bo)] + [(A1\B1) x (A2N Ba)]. 


(it) For an arbitrary Mn, we proceed by induction. ee we 
assume that if A,Be x ,A\B= >> Ro” where R{” € 


x x Li. Now consider 


Sh = x YX De = 2 ee, 


i=1 


Lett A,BeEe SY x D?). Thus, A= A x A?) and B= BY 
x B®), By Case (i), 


A\B = [(AM\B®) x A@] + [(A® 1 B®) x (A®\B)] 


= [(T2R!”) x 4] + [RD x (4\B®)] 


(by Problem 5.6(i), Chapter 1) 


n n nt sn+1 n+1 
=z ae (Ri x A?) 7 a = Veal R : 


mr - Sear . . 
Now, because < 5}; is ™ -stable, it is a semi-ring. i 
= 


9.4 Remark. It seems plausible to define an elementary content 


yw? on x x; by setting w°(A) =[][;(A;) analogous to our 
i= i=l 


construction of the Lebesgue elementary content on R”. Using 
Lemma 2.5 and Theorem 2.6 of Chapter 4, we can extend the semi- 


ring x +}; to the smallest ring (in our case it is an algebra), 
— 


502 CHAPTER 6. INTEGRATION IN ABSTRACT SPACES 


A(X x3) of measurable elements (called figures), each one of 


which is either a rectangle or it can be partitioned in finitely many 
disjoint rectangles. Consequently, ju° can be extended to a content 7i 


to measure any figure F’ € A(X x] as, €:0. 17") = ¥_ oft Cty). 
We can go on in the direction of Caratheodory extension, making 
some additional assumptions. However, we will render the extension 
another way by specifying an integral measure (fully explicitly), 
which operates on X's, so that it equals .° on x pare O 


Until we get to that point, for the sequel, we establish some 
auxiliary results. The lemma below enables us to utilize a more eco- 
nomical generator of X’g, again with some resemblance to the 
formation of the product topology. Consider one more notation. Let 
G; be a subfamily of measurable sets from 47;. Let us denote by 


G= x Gi =; Xr XK Gi, 


the set of all measurable rectangles G; x --- x G, where G’;'s are 
picked from Gj. 


9.5 Lemma. Let G; be a generator of 3}; containing a sequence 
{Gi,: k =1,2,...} of sets, i=1,...,n, monotonically increasing 
to Q;. Then the product a-algebra ¥’, coincides with the o-algebra 


= x(, K Gi) (generated by G = ‘ K Gi). 


Proof. 


(2) Because G C Xa, it follows that 3’ C X'g. Indeed, every 
A; € G; is also an element of 5’; and therefore 17(A;) € 17(X;) C XY 
which implies that 


nm nm 
[[Ai = 177 (Ai) € Xe. 
i=1 i=1 

(ii) Now we show the inverse inclusion Xz C 3’. We prove 
that each 7; is »’-»’;-measurable. By our assumption, each generator 
of 3); contains a sequence {G;,,} ? ;. Consider 
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Gig X Gop X +++ X Gi-tk X Ai X Gitte X °° X Grp 
= G, € G, 
where A; € G;. Observe that 
Gy =7i(Gig) NN t(A;) NN (Gung). 


Therefore, 
sup, {Gy} = U Gi = m*(A;) Es» 
k=1 


(because we took the union of elements of 2’). Hence, we proved 
that the inverse image of an arbitrary element of G; (which is a 
generator of 47;) under 7; belongs to X’. According to Proposition 
3.4, Chapter 4, we claim that the same inclusion holds for an 
arbitrary element of >’; or that 7; is 4’-5/;-measurable. In as much as 


mT; 1S 3ig-57);-measurable for all 7 = 1,...,, it follows that X’ con- 
tains all measurable cylinders, that is, a generator of X’,. Thus »’ 
contains ¥’.. O 


9.6 Remark. Now we see that in light of Lemma 9.5, dig 
= 8. 57; 1s induced by a more “economical” generator than the one 
given in Definition 9.1, i.e. by all rectangles from »;'s. In some 
cases, when we fail to indicate a generator, we do consider 3’, as the 
o-algebra generated by all rectangles as per Lemma 9.5. O 


9.7 Examples. To tell sets and set collections in R from those in 
R” we will attach to the latter the superscript ”. 


(4) Let 0); =R, +, = B; = B (Borel o-algebra), i= 1,...,7. 
Then 2. = R” and Bg = & B;. We also know that there is another 


o-algebra in R", the Borel o-algebra B” = B(R"). What is the 
relationship between 6B” and Bz? Recall that 6” was generated by 
the semi-ring of n-dimensional half-open intervals, which, for 
convenience, we denote by S”. Observe that 

oO" =5 * ee eS. 


—~+__ 
n 
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a 


and that each S contains a sequence, monotonically increasing to 
Thus, by Lemma 9.5, B" = 2'(S,,) and Bz must coincide. 


(it) Recall that the Borel-Lebesgue measure \” on B” was 
emerged from the Lebesgue elementary content \", defined on S” 
as 


(am, b"}) = I — a) = TD Ao( (a b;]). 


We know that \” is a unique extension of that content on Bg. In 
light of Remark 9.4, we can look at a more general problem. Let us 
now consider an n-tuple of measure spaces (Qj, Xj, ji), 
1 =1,...,n. We wonder if there exists a unique measure ju on the 


a 


7 n 
measurable space ( oe 8,5) such that for each measurable 
=1 — 


rectangle A, 
L(A) = w(Al x «+ X Ap) = p1(A1):**n(An). O 


We will begin with a construction of such a measure to be re- 
ferred as a product measure. 
nm 
9.8 Definition. Let B be an arbitrary subset of 2 = [] 2;. We 
i=l 
define for a point a; € (2; the a;-section of B as 


By = {(w1, wee Wi-1,Wi4+1, Saga) 
E Q, Xr & Q 5-1 x a4 Xr & Q,: 
(w4, s+) Wi-1, Qi, Wit1,--- yn) = B}, (9.8) 


it aj is such that aj ¢ T;,(B), then (w1, wee Wi-1,Qj,Wit1,--- 
,Wn) € B and B,, = O. (See Figure 9.1 below.) 


Here 7;; was defined as 7; : 0 — Qy x 03. 


For the case of two factor spaces, for each a; € (2; we can forma- 
lize a;-sections as maps [Xx, P(Q2), Oa,] and [X'g, P(Q1), oa,] such 
that for any B € Ng, 
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Q, 


~ Gi 


13 (a)O B 


Q, 


B.. =T (15 '(45) OB) 


Q, 
Figure 9.1 
Ba _ Oa, (B) 
_ J {yeQe:(a1,y) € B}, a1 € m.(B) 
= 0. fin ate BY (9.8a) 
and 
Ba = Oa, (B) 
{7 € 1: (@,a2) € B}, ag € 7, (B) 
= 9.8b 
Q, a2 é Tox(B). ( ) 
O 


Without loss of generality we continue to work with two factor 
spaces. The lemma below claims that the codomains of the above 
maps in (9.8a-b) can be narrowed down to %’) and », respectively. 
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9.9 Lemma. Let o,, be aj-sections for some a; € 2:, 1 = 1, 2. 
Then they are the maps [X'g, 512, 0¢,| and [Xig, 11, 0a,|. In other 
words, for each A € 3'g, Oq,(A) is 3’y-measurable and o,,(A) is 
2/-measurable. 


Proof. Denote © = {A € Ng: oa,(A) = Aa, € Xo}. We need 
to prove statements (a) and (b). 


(a) As per Problem 9.4, ©” is a a-algebra in Q) ® Qz. The earlier 
Problem 9.3 claims that the section is commutative with all set 
operations which helps in proving assertion (a). 


(b) Any measurable rectangle is an element of »”’. This assertion 
is based on the property of the section 


Ag, EA 
Oa, (Al x Ag) = (Ay x Ao)a, — { a ¢ A, 


(9.9) 
(which is easy to prove using the definition of o,,), thereby making 
3), x XJ) a subset of 17’. By (a), 7” is a o-algebra, and the set of all 
measurable rectangles %'; x Xy is a generator of X's, therefore it 
follows that the latter is contained in ©”. On the other hand, due to 
the definition of 37’, 37’ C S’g. Therefore, &'’ = ’g and concludes 
the claim that o4,.(X¥9) C >. Of course, the proof that 
Oayx(Xig) C SX is identical. O 


We return to the problem of formalizing a product measure on 
Six posed in Remark 9.4. 


If j4; is a measure on XY; , 7 = 1,2, then according to Lemma 9.9, 
for an arbitrary set Ac 3 @ X, Ag, € Yo and A,, € X1, and 
thus 4;(Aq,) and ji2(Aq,) are defined terms. For a fixed A, j11(A,z,) 
and jl2(A;,,) can be regarded as functions of a2 and a;, respectively. 
The lemma below asserts that under some restrictions they are even 
measurable. 


9.10 Lemma. Let j1; and [lz be o-finite measures on 7; and 3, 
respectively. Then for a fixed set A € 3}, ® 75, the functions 


fa(a1) = oe(Aa,) and ga(az) = f(A) 


are 3)\-B,- and 3/5-B.,-measurable, respectively. 
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Proof. 


(i) We first prove this statement under the assumption that 12 
is finite. Let = {Ae XY, @ Ly: fa is %\-B,-measurable}. We 
show that 21 = 7; @ LX. 


a) For 2 =Qy x Qe and fo(a1) = p2(Q2) (is a nonnegative 
real number), we have that fo is {0,@}-6,-measurable, thus also 
»/,-B,-measurable, and hence 2 € 3’. Observe that the finiteness of 
jl is essential, because otherwise, we can only arrive at the weaker 
result that f4 is 57;-B,-measurable. 


b) Let A € 2. Then f4 is measurable and 
AS € 3 @ Lo. 
Then, due to Problem 9.2, 
fac(a1) = H2((A%)a,) = Hea((Aa)*) 
= pla(Q2) — p2(Aa,) = falar) — fala) 


is measurable as the difference of two measurable functions. This 
implies that A° € 2. 


c) Let {A,,} be a sequence of disjoint sets from 2. Then, again 
by Problem 9.2, 


fox ,Ay(@1) = H2((na1An)ar) = H2( Lina (An)ar) 
= Vinait2((An)ar) = nai fa, (a1) = supt Dina fa, (aa) } 


is clearly measurable. This implies that }>~° , A, € 2 and thus 2 is a 
Dynkin system. 


d) Now let A = A; x Ap bea rectangle. Then by equation (9.9), 
fa(ai) = [2(( Ay x A2)a,) = [2(Aa), ifa, € A\ and 


fa(ai) = 0, if ay ¢ Aj; 
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and therefore, 


fa(a1) = pe(A2)1a, (a1) (9.10) 


is 3\-B,-measurable as the product of a constant function /12(A2) 
and an indicator function of a measurable set A; € |. Thus, 2 
contains the set 37, x ) of all measurable rectangles which is /M - 
stable. On the other hand, %’; ® 5 is a Dynkin system generated by 
an (-stable generator, which implies 17; ® 3’) C 2 (as 2 is a Dyn- 
kin system just containing 7; x 3)). However, by the definition of 
QA, the family A C X', & YX. Therefore, 2 is a c-algebra and 


Y= Dd) @ Mo. 


(ii) Now we assume that jug is o-finite. Then there exists a 
sequence 


{Ok} C Ly 
such that 
O57 Q2 and p2(Q5) 


is finite and we can apply the above result to the trace o-algebras 
M O85, and state that f4 is 


= 1 O&-B,-measurable, 
where 


fi(a1) = [2(Aa, a) OF) and Aeé Dy ® Lo. 
Consequently, by continuity from below for juz, the function 
[2(Aa,) —_ fa — sup{ fi} = lim - b2(Aa, M 05) 


is 7,-B,,-measurable. oO 


Now, we come to a major result in this section about the 
formation of a product measure. 


9.11 Theorem. Let (Q);, 5’;, j4;) be two measure spaces and let 1; 
be o-finite measures. Then there exists a measure ,s on 3) ® Xo 
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such that, for each A€ 3 ®X%, (A) is determined by the 
formula 


= fp2(Aa,) pi (day) = f (Aa) (daz). (9.11) 

Specifically, for A= A, x Ao, 
p(Ay x Ag) = p1(A1)2(Ad). (9.1 1a) 
Proof. Let f4(a1) = f2(Aa,). Then, by Lemma 9.10, f4 is a 


nonnegative extended-valued »-B,.-measurable function. 
Therefore, the integral 


A + J fa(ai)pn (dar) (9.11b) 


is defined for any »’.-measurable set A and it is a nonnegative set 
function denotes by j1(A). We will prove that ju is a measure on ¥’g. 
Clearly 41(@) = 0. Using Problem 9.3 we can show that 


Fx, An — aeree 
for {A,,} as a sequence of measurable pairwise disjoint sets of 3’. 


Applying this equation to (9.11b), we have, by Beppo Levi's 
Corollary 2.2, 


Lu dina An) = J ae A, (G1) H1 (day) 
= pear J fade = pyaar ees? 
In particular, for A = A, x Ag, 


j(Ai x Az) = ffa(ai)p1 (dar) 
by (9.10) 


= f p2(A2) 1a, (a1) pa (day) = fy (A1)p42(Ad). Oo 
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The measure jz on 7; ® 2 1s called a product measure and it is 
denoted by pu; ® fg. Theorem 9.11 is readily extendible to the 
product measure & ji; on finitely many measure spaces. 


The uniqueness of the product measure is subject to the 
following. 


9.12 Theorem. Let (Q;, ©’;, 14;) be measure spaces such that G; 
are (\-stable generators of 33;, 1=1,...,n. If [4;'s are o-finite, 
then the product measure & [i on 6 Si is also o-finite and 


unique. 

Proof. Denote G = x : G;, which, according to Lemma 9.5, is a 
generator of Dig = © i. It is easily seen that G is  -stable. Indeed, 
using Problem 5.10 (a), Chapter 1, 


(Gi Kee XX Gi) a (A, KPee « A,,) 


in as much as each G; is M -stable. 


Because 1; is o-finite for each 7, there are sequences 
Git OQ, i=1,...,n, and k=1,2,..., 
such that ju;(Gi,) < co. Obviously, 


Q =T[Gint Q= TIM. 
j=1 i=l 


Furthermore, 
Gr) (Q*)) =< 111(Gin) 7 Ln(Gnk) < OO. 


Therefore, & jt; 18 o-finite. Now we recall again that the product 
‘= 


measure & 4; has the property 
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(,28 8) (hm) = mtr) 


for every measurable rectangle from »',. Specifically, this property 
holds for every rectangle selected from the generator G. If there is 
another measure ji on 5, with the same property on G, in other 
words, 8, i; = ji on G, then, by Theorem 2.13, Chapter 5 (on the 


uniqueness of the extension of a measure), these two measures must 
coincide on »',. This proves the uniqueness of the product measure. 
Observe that another measure on »', may exist. However, it cannot 
be a product measure. O 


9.13 Remark. Summarizing all results and formalism from 
Remark 9.4 throughout Theorem 9.12 we pass through the following 
major steps. 


(i) Given a finite family of measure spaces {(Q, %, pu;) : 
i =1,...,n} we defined the set x +’; of all measurable rectangles 


n 
which turned out to be a semi-ring on 2 = [| 9;. 
i=1 


(ii) We defined on x 37; an elementary content ju? by 


Me (R™) _ [1(A1)---n(An), (9:13) 
which was extendible to a content 7i on the algebra A ( x x) of all 


n-dimensional figures F’ a: res as a finite union of disjoint 
rectangles: F = 37" ,R\” so ae ‘ele such figure could be 


eee dee TP HR). 


(iti) An extension of 7i from A(X, x) to 3. was rendered ex- 


plicitly via integration. More precisely, the common transition from 
the elementary content on the semi-ring to the content and then a 
measure was cut short by defining the measure 


[(A) = fp2(0a,(A)) ua (dar) = fu (oa,(A)) (daz) 
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on S'g in (9.11) that agreed with the content ju? (9.13) when jz was 
applied to rectangles. To avoid bulky notation, the results were 
established for two factor spaces. 


(iv) According to Theorem 9.12, measure ju turned out to be o- 
finite, which guaranteed the uniqueness of such a product measure ju, 
also in notation & L4;. Of course, we should have mentioned that 

= 


j4;'8 Were assumed to be o-finite to begin with. 


(v) In conclusion, we note that although some elements of the 
Carathéodory extension were present in the formation of the product 
measure, it was not exactly the one we exhibited in Section 2, 
Chapter 5. For example, in one case of the product measure on Borel 
a-algebras in Euclidean spaces, the Borel - Lebesgue measure \” on 
Bb” and the product measure 8. A; on Bz must be identical (Problem 


mM 


9.8). However, the Lebesgue measure \j* on the Lebesgue o-algebra 


£*(R") and the product Lebesgue measure & A; do not coincide, 


as we learn from the example below. This was one of the reasons 
why we cared to distinguish the Borel- Lebesgue and Lebesgue 
measures. It would be fruitful to further investigate the differences 
between topologically induced o-algebras and Borel or Borel-like 
measures compared to those stemming from Carathéodory 
extensions on finite product spaces. However, this is beyond the 
scope of this book. O 


9.14 Example. Recall that the Lebesgue measure Aj is complete 
on Lebesgue o-algebra £*. Consider Lebesgue measure \j on £*(R). 
We show that 5 ® Aj is not complete. Let @ be any subset of R that 
does not belong to L*(R). Then, {0} x Q ¢ L* @ L* or else, by 
Problem 9.4, Q would be an element of £*. On the other hand, 
{0} x Q is a (proper) subset of {0} x R. The latter set, by Problem 
3.1, Chapter 5, is a measurable Borel null set. Hence, {0} x Q is a 
negligible non measurable set in L* & L*, although it is measurable 
in the Lebesgue space (R?, £*(IR”), \?*), known to be complete. 
Hence, the space (R*, £*(R?), AZ") does not coincide with the 
product measure space (R?, L* @ L*, Aj @ Xj). Obviously, the same 
discrepancy takes place in the n-dimensional Euclidean spaces 
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(R”, £*(IR”), Ag”) and (Rr, B £1 a), Mi) in contrast with its 
Borel - Lebesgue counterpart. O 


After we define the product measure on a measurable product 
space we turn to multiple integration of measurable functions on 
product spaces with respect to the product measure. 


9.15 Definition. Let f :0Q) x Q2 — Q3 bea function, Q = Q; x 
Q2 x Qs and let 71, 72 and 73 be the corresponding projection maps 
in). Let 


7123: Q = QQ» x Qs 
be the projection map defined as 723(w 1, w2,w3) = (w2,w3). For a 


distinction, we now use a different character for the section operator 
applied to functions from that we used for sets. Denote 


gaof = fa, = T23x(™j(a1) N f) 
and call it the a,-section of f. In other words, 


Pa of = {(We, f(ai, w2)) swe € Oo} 
or also in the form 
fa, = {(we, w3) € Og x N3: (a1, we, wW3) € f}. 
The ay-section is defined similarly. O 


9.16 Proposition. Let (Q4 x Qo, 5; @ Xy) be a measurable 
product space, (Q3,5'3) be a measurable space, and let f : 
Q) x Qg — Os be a B® Xy-L’3-measurable function. Then the 
section fy, is 3/9-/3-measurable and f,, is 3/,-'3-measurable. 


(See Problem 9.13.) 


The theorems below resemble some of the original versions of 
Fubini's theorem. Theorem 9.17 (which by many is referred to as 
Tonelli's theorem) states in essence that if the integral of a non- 
negative extended-valued »' ® 3’5-measurable function is finite, 
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then it can be calculated by using iterated integrals of formula (9.17) 
below. To check whether an arbitrary »'; @ 3/)-measurable function 
f is integrable one can also apply Theorem 9.17 directly to |f| by 
using formula (9.17). 


9.17 Theorem (Tonelli). Let (Q;, Xi, Hi), = 1,2, be o-finite 
measure spaces and let f € Co1(Qi x Oo, 01 @ 7; 1R, B(R)). Then 
the functions 


art J fa, (W2) dui2(we) and ag f fa,(w1)dpn(w1) 
are 5- and S'y-measurable, respectively, and 
Sfdpr ® po = ff fa; w2)dpe(we))d pn (a1) 
= JCS fa, (1) dps (1))dpt2(a2). (9.17) 


Proof. 


(i) As usual, we begin with nonnegative simple functions. Let 
s= or aily,, Ai € 21 @ Xb. 
By Problems 9.5 and 9.14, 
Sa; (W2) = D371 0u(14,)q, (2) = WL ail (ay, (w2), 
and the latter is measurable and 
J 8a,(w2)dpo(w2) = SOP aupe((Ai)a,)s 


which is %',|-B,-measurable by Lemma 9.9. Hence we may integrate 
J Sa,(w2)dju2(w2) with respect to measure 1: 


I (Sa (w2)dpir(w») J dpi (a1) 


= 1% J H2((Aa)a,) Epa (1). (9.17a) 
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By Theorems 9.11 and 9.12, equation (9.17a) reduces to 


oO fl ® f2(Aj) = [sd ® po. (9.17b) 


(it) Let f € Ci'(Qy x Ng, ©) @ Ly). Then there is a sequence 
{s,} such that {s,} ? CW4(Q1 x Q2,C~+) and that f = sup{s,}. 
Thus 


Jfdp1 ® p2= supf{ fs dp ® jun} 
(by (9.17a) and (9.17b)) 
= sup J (J (Sn (W2))a,dp2(we2))d pn (a1) 
(by the monotone convergence theorem) 
= J (sup ($n(w2))a, U2 (we) dpi (a1). 
By Problem 9.12, finally, 
Jfdpr ® pe = JCJ fa, (w2)dp2(we))dpn (a1). 


Now it is obvious how to complete the proof of formula (9.17). O 


9.18 Theorem (Fubini). Let (Q;, ©;,4;), 7 = 1,2, be o-finite 
measure spaces and let f € L'(Q) x Qo, 51 @ Dy, [1 ® pg; R). 
Then the functions 


wo fa (wo) and wi fa,(w1) 


are [1-integrable j1,-a.e., and j.,-integrable jio-a.e., respectively; 
the functions 


ay +> J fa, (we) dp2(we) and ay > f fa,(wi)dpi(w1) 


are |11-a.e. and [19-a.e. defined, and they are j1,- and |19- integrable, 
respectively, and the formula 
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Sfdpn ® we = Jf fa, (w2)dpe(we)) dyn (a1) 


= JS faz (w1)d pn (w1))d pez (a2) (9.18) 


holds. 


Proof. Because by the assumption of the theorem, 


7. E DQ, x Qe, a1 @® D2, [1 ®& 2; R), 


we have by Problem 9.13 and then by Tonelli's Theorem 9.17 
applied to | f| that 


S\fldin ® po = ff \ fa, (w2)|dp2 (we) dps (a1) 
= J (J fa: (w1)|d p11 (w1))dp2(a2) < 00. 


By Proposition 1.21, the functions a; +> [| fo,(w2)|dpi2(w2) and 
dy +> {| fas(w1)|dui(w1) are jy- and j12-a.e. finite, respectively. In 
other words, for almost all ay € Q) and ag € No, | fa,| and | fa,| are 
jlg- and j.,-integrable, respectively. It is easy to verify that the same 
applies to the pairs of functions (fu,)~, (fa,)” and (fa,)*, (fa,)- By 
Theorem 9.17, the pairs of functions 


ay [(fa,)* (we)dp2(we), ar [(fa,)” (w2)d p12 (we) 


and 
ag b> Ue (w1))* dp (1), ag b> JF (fas (w1)) dpa (w1) 


are 4/|- and 4’5-measurable, and they are ju)-a.e. and ji2-a.e. defined, 
respectively. Finally, applying formula (9.17) to f*, (fa,)*, and 
(fa,)* and then to f~, (fa,)”, and (fa,)”, we arrive at formula (9.17). 

O 


9.19 Remarks. 


(i) Fubini's theorem generalizes Theorem 9.11 because for 
f =1,4 and A€ X ® X4, the result of Theorem 9.11 immediately 


9. Measures and Integration on Finite Product Spaces 517 


follows. This is why Theorem 9.11 is also referred to as Fubini's 
theorem. 


(22) Observe that Tonelli's and Fubini's theorems differ not only 
in that they are applied to nonnegative and arbitrary measurable 
functions, respectively, but also that functions on a product space 
belong to L!-space is a conclusion in Tonelli's Theorem whereas it is 
a hypothesis in Fubini's theorem. 


(iit) The above results (including Fubini's theorem) can 
naturally be extended from two spaces to any finitely many spaces. 
This involves a relatively cumbersome notation and is not discussed 
further except by way of examples. O 


9.20 Definition. Let (Q;,%;,;), i=1,...,n, be measure 
spaces with o-finite measures. Then the triple 


n 

i=l 1=1 4=1 

is called the product (measure) space. O 
9.21 Examples. 


(i) Let C,(2o,1r) be a closed ball in (R",7,). Denote V, = 
A"(C;,(8, 1)). We wish to show that A"(Ci,(xo, r)) = Vr - 7”, where 


Vag = ait, R= 1,2, 05 (9.21) 
and 


k 


Vor-1 = ao Ts BS en « (9.21a) 


A closed ball is clearly a Borel set. Let L,,(r, 29): R"” — R” be the 
bijective map, 


Lr; 29) 2) = Te + 20; 


where x, 2) € R” and r>0O. Then, C,,(6,1) = L*(r,20)(Ci(r, 
x0)). Observe that 
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L*(r, a9) = E(;) 0 M(— 2»), 


where EF means expansion with factor i and M stands for the 


parallel motion (i.e. translation, here with the shift — 29). Now, 


C,(8,1) = {xe R": || 2 ||? < 1} 


nm 
es ee Soa? < 1-2? —- 23, with 


(x1, 22) € C2(0, 1)} 
= Cn-2(8, 1-2} — 25) 
= Li_4((1 — 2] — 3)”, 0) (Cr-2(9, 1)) , 
where (21, 2%) € C2(6,1). This yields 
1 (91) (£15--., Zn) 
— 10,5 (6,\/T=a=a2) (13, ..-)2n) + 1¢,¢0,1)(21, £2) 
= Ups (1-22 22)1/2,8)(C,-2(6,1)) (3, +++ En) 
x loa) (21, a): (9.21b) 
Furthermore, 


Vn. [ Aa) = f1c, 9.1)(@)(A"— 2 @ \*) (dz) 
C;,(8,1) 


[by Fubini's theorem and by (9.21b)] 


= Jle,(0,1) (#1, £2) 
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n—2 2 
X J Ups_,((1—a22—23)"/2,0)(Cy_o(0,1)) OA” A", £2) « 


The interior (second) integral is, due to Proposition 4.3, part 1), 
Chapter 5, and the above observation, equal to 


f1c._.t61) ° E( - at — 02)—V?)\ (a3, ...,2n)dA"-? (23, «.., En) » 


By Proposition 4.3, part 2), Chapter 5 and by Theorem 4.1, the last 
integral equals 


(1-2? - 4 ll a Vas: 
Therefore, 


Vi = Vn-2 fl — vt — 23)? 16,041) (#1, 22) dA? (21, £2) 
(by Fubini's theorem) 
= Va» ff — 2} — 23)? 16,91) (a1, 22)dA(a1)dX(a2) . 


This is a Lebesgue integral of a continuous function on the unit ball 
and it can be reduced to a Riemann integral by using conventional 
techniques for Riemann integrals. For example, the double integral 
above is then 


ff eA = ep?) dédp = =, (9.21c) 
pela,z] 0€ [0,7] 
and thus 
Vi = Vad ey SB tae (9.21d) 


n? 


Let Vo = 1. Then, Vi = 2 (as the Lebesgue measure of the interval 
[—1,1]). By (9.21c), V2 = a (which agrees with the definition of 
Vo). 


2 2? 2-1 7 
Y3=VU35= 77 .and Vy= F. 
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The validity of formulas (9.21) and (9.21a) is then easily shown by 
induction and the use of (9.21d). 


(72) We show that Fubini's theorem need not hold when at least 
one of the measures, f; or jig, is not o-finite. Let (Q;, X;) = 
({0, 1], B((0, 1])), ¢@ = 1,2, por = Resjo1jA, and 2(A) = |Al, if A is 
finite and f2(A) = oo, if A is infinite, where A € 35. Denote 


De { (#1, £2) € (0, i? Q= x2}, 


the diagonal of the square (see Figure 9.2 below). We show that 
DEX) @ Ly = B?((0, il 7), Let 


and 


nm 


LAs = U (Brie x Snk)e 


k=1 


D x 


Figure 9.2 


Then D € X ® X» for D = () An. Now we find 


n=1 
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J t2(Dz)A(dx) = A((0, 1]) = 1 (because pi2(D,) = 1). 
0,1 


On the other hand, 


J A(Dy) 2 (dy) = 0. 
(0.1 


So as we see, Fubini's theorem or more precisely, the second 
equation in (9.11) of Theorem 9.11, does not hold. 


(iii) Let (N, P(N), 7) be the counting measure space introduced 
in Example 1.2(vi22), Chapter 5, for more general measure spaces. 
We consider a sequence {s,,} of nonnegative simple functions on N 
as 


Sn = oly, 2 =0,1,..., 
where {a;,} is a sequence of nonnegative reals, so that 
g = sup{sn} = Ooi gy. 
Hence the integral of g turns to the series: 
fgdy = sup fs,dy = 77 ai. (9.21e) 
This is readily extendible to a series with real-valued terms. In other 


words, the integral of a sequence {a,} CR with respect to the 
counting measure 7¥ is represented by the series in (9.21e). 


Let {f,} be a sequence of nonnegative functions of C,'(, 2) 
and let ys be a o-finite measure on 2’. The above counting measure 
is o-finite, therefore the function f [where f(n,w) = fr()] 
obviously meets the conditions of Tonelli's Theorem 9.17: 
f €Cz'(N x 2, P(N) ® XY). Consequently, the sections 


es J fr(w)du(w) and a +> (fig ,a)dy 
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are P(N)- and ¥’-measurable, respectively, and 


ffdy@p= Te Sfrdu=SO2ofrdp. (9.218) 


Although (9.21f) is a nice illustration of Tonelli's theorem, it is a 
slightly weaker alternative to Beppo Levi's Corollary 2.2, since the 
latter does not require ju to be o-finite. 


Now, let {fn} €C1(Q, ©’). To use an analogue of Fubini's 
theorem, we need to make sure that the function f belongs to 
D1(N x 0, P(N) ® 2,7 @ pw; R,), or, alternatively, apply the above 
procedure initially to the sequence {| /f;,|} instead. Then, from (9.21f) 
we can get 


S\fldy@ w= Dros lfnldu 
= Jol fale. (9.21g) 


Should now, f{|f|dy® ys or their equivalents, 7 /|fnldj or 
Jor ol fnldy, be finite, it would yield that 


fe L(NxQ, P(N) ® 2,72 u;R) 


and therefore, Fubini's formula (9.21f) would hold true, now for an 
arbitrary sequence of measurable functions {/f,,}. Notice that, 
because 


Sl fldy @ wu < 00 


is a necessary condition for f €¢ L'(N x 0, P(N) @ Y,7® p;R), it 
automatically implies that 


4) eee (9.21h) 


Jol fold < CO 


or 
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would be alternative necessary conditions for f to belong to the 
space L'(N x Q, P(N) @ Y,y @ pw; R) (although the latter is in no 
way a necessary condition for Fubini's theorem). In some 
applications, this version of Fubini's theorem can challenge the 
generalized monotone convergence Theorem 2.4 and Lebesgue's 


dominated convergence Theorem 2.6. 


(tv) As an illustration to the last application of Fubini's theorem, 
consider a r.v. X on a probability space (Q, X’,P). The function 
m(0) ++ Efe’*] (normally, complex-valued) is known to be the 
moment generating function of X. If we formally expand e?* in the 
Maclaurin series, 


Ox _ co 6" yn 
€ ~~ Dip ni 
assuming m(@) is analytic at zero, we have with 


m() = Ely nmo nt X"| 


a precedent for the application of Fubini's theorem discussed in 
Example (227). Hence, we have to make sure that, in light of (9.21h), 
the series 


\0O 6p | |0|” co =|d|", n 
rok S1xi] = De Sex" 


n=0 n! 
is convergent in some vicinity of ? = 0. [The latter holds if m(6) is 
analytic at zero yielding that E[|X|"] < co for all n.] Assuming that 


all absolute moments of X exist and the above series does converge, 
the application of Fubini's theorem (9.21f) yields that 


m9) = or.) SELX"I, 


n=0 nl - 


as a Taylor series expansion of m(@) in terms of all moments of the 
random variable X, and consequently that 


E[X”] = m™(0), n=0,1,.... 
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(v) Consider Borel - Lebesgue measure \? =  ® A on Borel o- 
algebra B?. Let A = Q x R. According to Problem 3.1, Chapter 5, A 
is a countable union of Borel-null sets. Thus, 


(A) = fladd? = 0. 


On the other hand, the section (1,4),, is not A-integrable for all 
a, € Q. This is, however, in agreement with Fubini's theorem that 
the function (1,4),, is A-integrable only for almost all a, € R. 


(vz) Now we discuss yet another application often occurring in 
probability theory. Let jzp and jug be two finite Borel - Lebesgue - 
Stieltjes measures induced by distribution functions F',G € D(R, B) 
(see Remark 3.5(i22), Chapter 5). Recall that 


bitty» 6 (o) = Hie) = 0; 


From Problem 3.7, Chapter 5, given a compact interval J = [a, }], 
we have that 


pur @ pg?) = prc) 
= [F'(0) — F(a — )I[G() — Gla — J. (9.211) 
Let T, = {(x,y) €[a,b??: y>a} and T= {(z,y) € [a,b]? : 
y <a} be the upper and lower triangles of the square 1°, 
respectively. Now we calculate the measure of J? under pp ® pq by 
using Theorem 9.11 in terms of Lebesgue - Stieltjes integrals: 


Ur ® uc([a, b]*) = ur ® uc) + br @ uc(T.) 


= J Lc([a, x))ur(dz) + fur(La, y))uc(dy) 
I 
= |[G(x) — G(a — )]ur(dz) 
I 


+ fLF(@ —)— F(a — )luc(dz). (9.21)) 
I 
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Equating (9.211) and (9.21j) we arrive at 


[G@)ur(dx) + [F(@ —)uc(dx) 
I v8 


= F(b)G(b) — F(a — )G(a - ). (9.21k) 
Interchanging the roles of F' and G' we have 


[Ga —)ur(de) + f F(@)ue(de) 
i I 
(9.211) 
= F(b)G(b) — F(a —)G(a—). 


Hence, from (9.21k) and (9.211) we establish the following 
integration by parts formula for Lebesgue - Stieltjes integrals. 


F'(b)G(b) — F(a — )G(a — ) 


= fo{F(a) + F(a —)}ue(dz) 
I 


+ [3{G(a) + G(w — )}ur(dz). (9.21m) 
I 
Oo 


9.22 Remark. In Chapter 5, sections 1, 4, and 5, we gave a rather 
extensive treatment of the notion of independence of families of 
events and r.v.'s. Let (0, ©’, P) be a probability space and X; be a 
(42, 27 )=( Oe, 07) O05 9 = 1.1.90. Recall that rv.'s Xj, ..:.,X,, are 
independent if the o-algebras {o(X1),...,0(Xn)} generated by 
these r.v.'s are independent. We have a new look into this notion. 


First we denote © Xj = {X1,...,Xn} iM x xO 


and call it the product map. Unlike Definition 4.12, Chapter 3, here 
the product map of functions is defined slightly differently: 


® Xi(w) = (Xi), + Xn(w)) = Wry -- Wn): 
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Consequently, we specify 


i=l tl 


r 
(8x:) (4) = (iW)... Kalu) we AACN 
and 
fi. (8%) 
i 


= {Xi ({wi}) 1+ XA ({wn) : Wr ---1n) € BH, 


BCI. (9.22) 
In particular, if B= B, x --- x By, 
(3 xi) (By = AGB) Ti (B,,). (9.22a) 
1 


Indeed, if 


we have 
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> w € X*(Bi)N... N.X*(Bn). 


It appears (Problem 9.22) that the product map & X; is L- 
&, 57;-measurable. Therefore, by 


we can define a probability measure on (i oes & ;) and call it 
i=1 += 


the joint distribution of r.v.'s X1,..., Xn. 


Denote Px, = PX; the distribution of the random variable X; , 
i = 1,...,n, which is a probability measure on %;. Then, according 
to the eoneicnan we deal with in this section, we set the probability 
product space 


n n n 
I] ae ca J = Px, : 


i=1 


On the other hand, we already have another measure Pzx, on 


(i! 2%, @ x) which, in general, need not be a product measure. 


i=1 


The following statement sheds light on this dilemma. O 

9.23 Theorem. The joint ae distribution Pax, is a 
product measure and equals | 8, _ if and only if the rv.'s 
X1,...,Xn are independent. i 


n 


Proof. (7) Suppose Pzx, = 2 Px,. Then, as the product mea- 


sure, & Px, has the property that is factorized at all measurable rect- 
— 


angles, namely, given sets Ay € %},..., An € Xn, 


@ Px(Ai x +++ x An) = Px, (At): Px, (An), (9.23) 
which by Problems 4.11 and 4.12 means that X,,...,X,, are inde- 
pendent. Indeed, by the assumption, the left-hand side of (9.23) is, 
according to (9.21a), 
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(ii) Suppose X4,. 
lem 4.12, 


Pe x,( At Xr & An) 


nr (22) tas 
= | xia) een X;(A0) 
= Px,(A1)---Px, (An). 


nm 
On the other hand, the product measure ® 
i=1 


property on rectangles (9.23), meaning that Q 
i=1 
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(9.23a) 


.., Xn are independent. Then, according to Prob- 


x, has the factorization 


x, and Pax, co- 


incide on x »/;. By Theorem 9.12 the two measures must be equal 
i= 


also on the product o-algebra ® Ji 
i=1 


PROBLEMS 


9.1 


a 


Let Go be the set of all measurable rectangles A = A, x --- 


x An, A; € X;. Denote by (Go) the algebra generated by 
Go and by C(Gp) the collection of all finite unions of disjoint 
rectangles of Go. Show that 7(Go) = C(Go). 


9.2 Prove that 


Eq,(A1 X Az) = (At X Aaya, = { 


Ao, 
0, 


a, € Ay 
a, € Ay” 
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93 


9.4 


9.5 


9.6 


9.7 


9.8 


9.9 


9.10 


Prove that the section is commutative with respect to all set 
operations. 


Show the validity of assertion a) in Lemma 9.9: Let oq, be a;- 
sections for some a; € Q;, i =1,2.Denote 37’ = {AE Xe: 
£a,(A) = Aa, € 2}. Then X" is a o-algebra in Qy ®@ Qe. 


Let AC Qy X Qe and let a; € 2). Show that (14), = 14.3 
also in notation, Y,, 014 = 1, ( A): 


Show that the Borel- Lebesgue measure X” on 6” and the 
product measure & A; on Be must be identical. 


Let (Q;, 27;),i = 1,...,n, be measurable spaces. Prove that 


that 


Let (R”,B", A”) and (R*,B*,\*) be the Borel - Lebesgue 
measure spaces. Show that 


( RM x Re B” Q BY. nN Q ey = (R™**, Br, TE), 


Let (Q;, Xi, ui), 7 = 1,2, be measure spaces with o-finite 
measures and let Ac %', ® Xy. Show that the following 
statements are equivalent. 


1) 41 @ f(A) = 0. 
2) p2(Aa,) =O pi-a.e. on D4. 
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9.8 


99 


9.11 


9.12 


9.13 


9.14 


O15 


9.16 


9.17 
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3) w1(Ag,) = 0 po-a.e. on Qo. 


Let AC Qy Xx Q2 and let ay € 2). Show that (14)a, = 14,3 
Show that ee) = (f* (Asa: 5 A3 @ Qs. 


Let (Q;, 2, i),2 = 1,...,n +k, be measurable spaces, such 
. n n+k 
that j1;'s are o-finite measures. Suppose & His ® pj, and 
| i=n+1 
n+k nt+k 
& jt; are product measures on product spaces ® Diy @ Dey, 


i=1 i=n+1 


and” S 5/;, respectively. Prove that 
i=n+1 


Show that f7(A3) = (f*(A3))a,, A3 © Qs. 


Prove Proposition 9.16. Let (Qy x Q2, X, @ 32) be a measur- 
able product space, (Q3, 3) be a measurable space, and let 
ff: Qa x Qe — OQ3 be a XY @ 39-5'3-measurable function. 
Then the section fa, is 3/-5’3-measurable and f,, is 3-5/3- 
measurable. (Hint: Apply Lemma 9.9 and Problem 9.9.) 


Show that €,,o(af + Gg) = a&a,of + B&a, og. 


Let f € Cy'(Q) x Qo, X', ® Xy) and let a sequence of simple 
functions {s,} C W,(Q) x Q2,C~') be such that f= 
sup{s,}. Show that f,, = sup{(S,)a,} (Hint: Apply Theorem 
6.4, Chapter 5, and Problem 9.14.) 


Show that |f|, = | fa|,(f")a = (fa)™, and (f-)a = (fa). 


Let G, and Gy be semi-rings on (2; and Qo, respectively. Is 
G, X Go also a semi-ring? 
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9.18 


9.19 


9.20 


9.21 


9.22 


Let ju; and 1; be finite positive measures on measurable spaces 
(Q;, 7), 7 = 1,2. Show that ifv; < yj, i= 1,2, then 4 ® % 
< [41 ® [l2, where both product measures are on the product 
space (Q) x Ng, X ® HX). 


Let (0, 7,41) be a o-finite measure space and let f € 
Cz'(Q, Z). Prove that 


J fdp = fut = ep)ACdz). 


(Hint: Use Theorem 9.11.) 


Under the condition of Problem 9.19, let g: Ry — Ri, bea 
continuous monotone nondecreasing function such that 
g(0) = 0 and which is continuously differentiable on (0, 00). 
Show that 


fo Adu = (Lf g(@)utf > «})Adz) 
(0,00) 


=R)fo gf @)e(f = x})de. 


Show that if F and G in Example 9.21 (vi) have no common 
discontinuities, then formula (9.21m) reduces to 


F(b)G(b) — F(a — )G(a —) = J F(x)uc(dz) + [G@)ur(dz). 
I I 


(P9.21) 


Show that the product map 8 Xi introduced in Remark 9.21 


: n 
is 3/- ® »;-measurable. 
= 
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Chapter 7 


Solutions to Selected Problems 


Chapter 1. Set-Theoretic and Algebraic Preliminaries 
1.5 Prove that: 

a) (A\B)° = ACUB. 

b) [((A°U B)° U (AU B®)]f = B\A. 

c) (AN B)U(AN B®) U(A*NB)=AUB. 


Solution. 


a) Follows from A\B = AN B* and the application of the idempo- 
tent and De Morgan's Laws. 


b) Using idempotent and De Morgan's laws we have 
[(AS U B)* U (AU B®)]* = (ASU B)N (AS NB) 

(by associative and distributive laws) 
=(A-UBniAyYNB 

(because BM A° is a subset of A°) 
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= BM A® = B\A (by Problem 1.3). 
c) (AN B)U(AN B’)U (ASN B) 
= (AN B)4+(A\B)+(B\A) = AUB. O 


1.6 For each of the following, justify with a proof or give a coun- 
terexample. 


aJAUC=BUC = A=B, 
b) (AU B)\B=A. 

c) A\B=C\B = A=C. 
d) A°\ Be = B\A. 


Solution. 


a) Is not true. For instance, let A and B be arbitrary, but different 
subsets of C’. 


b) Is not true: (A U B)\B = A\B. 

c) Is not true either. For example, let A = B+C and none of the 
sets is empty. Then, A\B =C and C\B=C. However, A#C. 
Thus, A\B = C\B. 

Histiet A\e* =A ip =|B A, C 


19 Let {A,: n=1,2,...} be an arbitrary sequence of sets. 
Define Ap = () A, and Ax = LU An. Prove that 


n=1 n=1 
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Solution. 1) Let x € limA,. Then, by the definition, x €) ) Ax. 


n=1 k=n 


[e.<) 
In other words, there is an no such that x € {] Ax. Since A, C 
k=no 
ee) 


CO [o.@) 
LJ Ag, Anti © Ax,..., it follows that also x €U Az, rE 
k=1 k=2 i 


LJ Ap,..., and hence, « € () LU A; = limA,,.. Consequently, the 
k=2 n=lk=n 


middle inclusion holds. 


2) Clearly, Ag C Ap U (14x) U (fr) U--- = limA,,. 
k=2 k=3 


Co 


3) Obviously, An. D An MN ( U Ar) Nn ( U As) Q---=limA,. 
k=2 k=3 


1.10 Let 2 be an arbitrary set. Find a sequence { F,,} of subsets of 2 
such that 


lim E, = @ and lim £, =. 


Solution. Let Eo, = Q and Eo,-1 = @. O 


2.4 Show the equivalence of the following statements. 
a) f is injective. 


c) For every pair A and B of disjoint sets, f,(A) 9 f.(B) 
= @. 


What can go wrong with b) if f is not injective? 
Solution. We begin with some preliminaries. The problem is 


essentially based on Definition 2.1 (vii) of f,(A) and it can be 
interpreted as follows: 
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f.(A) ={y © Y:datleastonex ec A: f(x) =y}. (P24) 
Thus, for each x € A, f(x) € f,(A). 


(i) We first show that f.,(AM B) C f,(A) 7 f.(B) regardless of 
whether or not f is injective. Indeed, let y € f,(AM B). Then, by 
(P2.4), there is at least one point x € AM B which is associated with 
y through f, ie. y= f(x). Then, ce A & x € B implies that 
y= f(x) € f.(A)N f.(B). 


(it) Now, assume that f is injective and show the inverse inclusion 
f(A) fe(B) S fe(AO B). 


Let y € f.(A)N f.(B). Then, y € f,(A) and y € f,(B). By (P2.4), 
there is at least one a € A such that y = f(a) and least one b € B 
such that y = f(b). In general, a and b can be different, but because 
f injective, a=b, say x. Thus, re A & cE B =>xEANB 
=y = f(x) € f.(ANB). 


Parts (7) and (iz) prove: a) = b). 
(iti) b) => c) is trivial. 


(iti) c) > a). We show that f(x) = f(x2) implies x; = x2. Let 
A = {x1} and B = {x2}. Then, AN B = @, and by assumption c), 
fe({21}) 1 fe{22}) = @. 


(iv) Suppose f if not injective and let y € f.(A)M f.(B). Since 
y € f,(A), there is ana € A such that y = f(a) € A and there is a 
b € B such that y = f(b) € B, and because f is not injective, a 
need not equal b. If so, {a, b} need not be a subset of AN B and con- 
sequently, 


{y} = fe({a, bf) E f(A B). 


The latter means that y ¢ f,(AM B) and thus 
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f(A) f(B) E f.(AO B). 


An easy counterexample illustrates the discrepancy of b). Let f(x) 
= x? and A = ( — 3,2) and B = (1,4). Then, 


ANB= (1,2), f(A) = [0, 9), f.(B) a (ly 16), 
f(ANB) = (1,4), f(A) 1 f(B) = (1,9). O 


In the following problems we assume that f is a map from X = Dy; 
into Y. 


2.6 ShowthatVB CY, f.(f*(B)) CB. 
Solution. If y € f,(f*(B)), then there is an x € X such that 
x © f*(B) and f(x) = y. But x € f*(B) implies that y = f(x) € 
BiY CB, O 
2.7. Show that [X, Y, f] is surjective if and only if 
fe({*(B)) =B, VBCY. 

Solution. 

(2) Let f be surjective and B C Y. Then, for y € B, there is an 


x € X such that f(x) = y. (Notice that unless f is surjective, such 
an x need not exist, ie. f*({y}) can be .) Hence, 


ce fy} So f(B) > ce f(B) 
=> y= fae f(f*(B)). 


In as much as it holds for each y € B, B C f,(f*(B)). The inverse 
inclusion follows from Problem 2.6 for all functions. 


(iz) Let f.(f*(B)) = B. Then, in particular, for B = {y}, 
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f.(F* (yh) = ty} = Wwe Y, PUyp #@ 
and thus f is onto. O 
2.8 Show that the map [P(R;), P(D,), f*] is injective. 


Solution. Suppose B, # By € P(Ry) and suppose without loss of 
generality that there is a y2 € By such that yo ¢ B,. Then, obviously, 
sets f*(B,) and f*({y2}) are disjoint [or else there is an x belonging 
to both sets such that f(a) € By, and f(x) = y2 which means f is a 
multivalued function]. However, this is impossible if f*(B,) = 
f*(Bo). o 


2.10 Is the map [P(Ry), P(Ds), f*] surjective? 
Solution. No, it is not. Take for instance the map 
[Xx a {X1, to}, = {y}, f] 


such that f(a1) = f(x2) = y. Then, f*(P(Y)) = {X,@} which is 
a proper subset of P(X) and does not recover {2} and {x9}. O 


3.1 Prove Theorem 3.1 under the condition that f is an into map. 
Let |X,Y, f| be an into map and let B C Y. Then 


[f*(B)) = f*(B*). 


Solution. Suppose BM Y’ # @ and let B’ denote the trace of B on 
Y’. By Theorem 3.1, applied to [X, Y’, f], we have that 


[PB = (BY), 
where B%, denotes A\B. Obviously, f*(B') = f*(B). Thus, 


LPB = (FBO = PCB) 
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and because f*({y}) = @ for all y € Y\Y’, the latter is also equal 
to f*(B°). The case with B C Y\Y" is trivial. Indeed, f*(B) = @ 
and [f*(B)]% = X. On the other hand, Y’ C B°, and 


PRH=TOyex 
Combining the above arguments we have, 
PCB) Ff = xe =O = 7"). O 


3.4 Let [X,Y,f] be an into map and let {B;:1¢€J} be an 
indexed family of subsets of Y. 


a) Prove that. {*( () B;) = () f*(B)): 


ie. ie. 


b) If B, and By are disjoint, show that their inverse images 
under f are also disjoint. 


c) If {B;: i € I} is a pairwise disjoint family, show that 
f* (YierBi) = Lier S*@Bi. 
Solution. 


a) By the idempotent law, 


r(g)=[te(ge)}] 


by Problem 3.1 and De Morgan's law 


-[r(yn)] 


by Problem 3.3 


= ( Ure) 


iel 
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by Problem 3.1 


=(uirwar) 


iel 
and by De Morgan's and idempotent laws 
=) f*Bi). 
iel 
b) Using a), f*(Bi)N f*( Bo) = f*(B1N Bo) = f*(O) =O. The 
latter is due to Example 3.2. 


c) f*;_-Bi) = U f*(Bi) holds true for an arbitrary sequence of 
tel 

sets under the union, due to Problem 3.3. That the right-hand side 

equals 5°,_, f*(B;) is due to b), because {f*(B;)} is a family of 

pairwise disjoint sets. O 


3.6 The results above prove that all set operations are closed under 
the inverses of maps. Show that not all set operations are 
closed under maps as per the following. 


a) Show that maps preserve inclusions. 


b) Show that maps preserve unions. 


c) Show that maps do not preserve intersections; specifical- 
ly, show that 


f.(MAi) © AAD) 


iel ie. 


and that the inverse inclusion need not hold. Explain the 
latter with and without a counterexample. 


d) Do maps preserve the difference? 


Solution. 
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a) Let A C B and let y € f,(A). Then, there is an x € A such that 
f(x) = y. From A C B, it follows that x € B and thus y = f(x) € 
f.(B). Because y = f(x) was assumed to belong to f,(A) and 
y € f.(B), we have the pledged inclusion. 


Here is another proof. 


Let A C X (= Dy without loss of generality). 
(i) Let y € f,(A). Then, obviously f*({y}) NAA @. 
(ii) Hey ¢ f(A), then, f*({y) NA =O. 


Now, if given AC B, f,(A) Z f.(B), then there is at least one 
point y € f,(A) that does not belong to f,.(B). Now, this leads to a 
contradiction due to (7) and (77), in as much as f*({y}) intersects A 
but not B. 


[Notice that if f is inconsistent, then the worst case scenario is that 
f.(A) maybe empty, but then the inclusion f,(A) C f,(B) automa- 
tically holds. ] 


b) We show that for any indexed family {A;; 7 € I}, 


iel 


(i) Ify € at i) Ai) , there is an x € |) A; such that f(a) = y. Fur- 
iel wel 


thermore, there is an 7g € J such that x € Aj;, and hence, 


ve 
Consequently, 


wel 
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(ii) If y € U f.(Ai), then there is an ig € I such that y € f,(Ai,) 


wel 


and thus there is an x such that f(~) = y € f, ( U A\) . This yields 
tel 


iP ( U A) > U f.(A)). 


el ie. 


c) (4) Ify € f, ( () A\) , then there is an x € () A; such that f(x) 
tel tel 
= y. Thus, x belongs to each A; and y = f(x) belongs to each 
f.(A;). The latter yields that y € () f.(A;) and hence 
tel 


fe ( a A) Cf) f.(Ai). 


iel tel 
(it) Let y € () f.(A;). Then for each i, y € f,(A;) and hence, for 
iel 
each 7 € J, there is at least one point, say x; € A;, such that f(2;) = 
y. Let Ai = f*({y})N Ai. Now, if () Al #@, then for each 


wel 


fe)=ver(N a) cs(14\) 


tel ier 


implying that (]) f.(A;) C (0 A). However, if (]) A; = 9, 
tel iel tel 
there is no x€f} A; such that f(x)=y and _ thus 
tel 


(|) fel As) € fe (n Ai). Exactly it means that for at least one y € 
iel iel 


() f.(Aj), all of its inverse images under f are scattered over A;'s, 
iel 
with not a single point common for all. 


As far as a counterexample, we can take [R,R,, f(x) = x7]. Let 
A=[-2, —1] and B = [1,2]. Obviously, these two sets are dis- 
joint, but not their images under f which are identically [1, 4]. 
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d) Let f = 194, A = [0,3] and B = (1,5). Then, A\.B = (0, 1] and 


thus 1j9,4)((0, 1)) = il}, while Vio(A)\10,4)(B) — {1}\{0, 1} =@ 
implying that the difference under this f is not preserved. O 


3.9 Let {A,} be a sequence of subsets of ©. Show that the 
function lim1 A, is the indicator function of the set lim A,, (1e., 
1. x , ) and that the function lim 1,4, is the indicator function 


a) m 
n=lm=n 


of the set lim A, (i.e., na 


Solution. (i) We need to show that lim14, = Ijq4,. Recall that 


mA, =() U Ar. 


n=1 k=n 
Ifx € limA,, then lima, (v) = 1 and 
xe UAp,c © UAk,-.-. (P3.9) 
k=1 k=2 
Therefore, there is an n, such that x € A,,. From (P3.9) it follows 


[o.@) 
that x € (J Ax. Hence, there is an np > nm, such that x € A,,. 
k=n,+1 


Continuing with this process, we arrive at infinite subsequence 

{A,,} such that x belongs to all of these sets. Consequently, 

14, (x) =1,7=1,2,.... As far as the rest of Aj, they are zeros or 
J 


ones. Hence lim14, = 1. 
Now, let x ¢limA,. Then, lima, (2) =0 and x ¢U Ax, x ¢ 
k=l 


J Ax,... implying that « ¢ A, for all n = 1,2,.... So, 14 (2) =0 
k=2 
for all n and thus we have lim1,, (2) = 0. 


(ii) We show that lim14, = lim a,. Let 
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LE lim A, = U () Ar. 
n=1 k=n 


Then lim 4, (a) = 1. Furthermore, there is at least one index ng such 


that point x € (| A; which implies that 


k=no 


Va, (@) = 1a, (@) = = 1. 


nQ 


Consequently, the whole tail of the sequence {1,4,(x)} equals 1 and 
thus lim 14,(“) = 1. 


Ifx ¢ lim A, = U () Ax, then lim 4,(x) = 0. On the other hand, 
n=l k=n 
by De Morgan's law, x € (] LJ Af and hence, by part (7), La, (x) 
n=l k=n 
=0, j=1,2,..., that is, there is a subsequence of {1,, (x)} 
convergent to zero. Hence lim 1,4 (x) = 0. O 


3.12 Let [X,Y, f] be a consistent map. Denote F' = f, o f*. Then, 
the map [P(Y), P(Y), F] is such that F’ = I, (J is the identity 
map) if and only if [X, Y, f] is surjective. 


Solution. (1) Let f be surjective. Then, for each y € Y, 


O# f'({y}) =ACX such that f.(A) = {y}. 


This shows that F{y} = f.o f*({y}) = L{y} = {y}. [Otherwise, 
if {y} is a proper subset of f,(A), then there is at least one x € A 
such that f(x) = yand f(x) =z y.] 


(it) Let BC Y. Then, B=) {y} and because the inverse pre- 
yeEB 


serves all set operations, we have 


f(B) =U fF Cyt)- 


yeEB 


CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 547 


Now in as much as /, preserves unions, 


FCP (BY) =U FP Cyt) 


yEB 
which by (7) equals ) {y} = B. This completes the proof that 


yeB 


7.0 f° =1,.if F is onto, 


(iii) Suppose that f is not surjective. Then there is a subset B of 
Y such that B = B, + By and B, C Ry, and By C Y\R;. Now, 


fo f(B) = f.(f(B1) +8) = Bi FB 
by (iz) because B; C Ry. O 


4.2 Classify the following binary relations. 
a) Let 2. be a nonempty set. Define the relation (P(Q), C ). 
b) Let 2 = R?\(z, 0). Define R: (a, b)R(c,d) = ad = be. 


Solution. 
a) Partial order. 


b) The collection of all points of Q involved in each relation are 
those on each individual line through the origin (excluding the origin 
itself). This argument is based on similarity of right triangles with 
vertices on these lines, on the line-by-line basis. Thus, all points 
(excluding the origin) in each individual line are related and only 
they. Thereby, R partitions (2 into disjoint lines through the origin 
and R is an equivalence relation. O 


4.3. The following theorem is a statement of the principle of math- 
ematical induction. 


Let S(n) be a statement which is true or false, for n= 
1,2,... . Let S(1) be true and suppose whenever S(n) is true 
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implies that S(n +1) is true, n=1,2,... . Then S(n) is true 
for all n. 


Prove it. (Hint: Use the well-ordering principle.) 


4.6 Let 1° denote the space of all summable real-valued sequences. 
In other words, a sequence [N, R, f], in the form of {a,,}, is 
summable, if the series ea converges. We define the 
partial order = onl’ as f < g if and only if f(n) < g(n) for 
alln € N. Is (1°, < ) a lattice? 


Solution. No, it is not. Take f,g € 1° such that f(n) = (—1)"+ and 


g(n) = (- Oe Then, either of these sequences is summable, 
because their associated series are Leibnitz. However, 


iVe= hg} 


is clearly not summable. O 


4.8 Is the set of all continuous real-valued functions a lattice? 


Solution. From (f V 9) = $(f +9+|f — gl) and (f Ag) =3(f +9 
— | f—g|), we see that (f V g) and (f A g) are continuous. Thus, 
the set of all continuous functions is a lattice. O 


4.10 Prove Theorem 4.4: Let [X,Y , f] be a consistent function with 
E; being its equivalence kernel. Then there is a unique 
function |X|Ez, Y, f| such that f = f 0 7p,. 


Solution. Let 21,22 € X. An element 2; € [ao]g, if and only if 
rE fxg [1e., if and only if 7p,(v1) = 7E,(X2)]. Therefore, m:,(x1) is 
the only equivalence class that contains 2;. Consequently, we 
showed that each element of X belongs to exactly one equivalence 
class for Er, even if Ey is merely an arbitrary equivalence relation on 
X. Now, because x;Efx implies f(x) = f(x%2) [see Example 4.2 
(vi22)], the function f assigns all elements x of an equivalence class 
7™,(x) to a single element y of Y. Therefore, f can be defined as a 
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map assigning the entire class [z]:, = T,(x) to a single element 
y = f(x) for all x € 7z,(z). The correspondence TE (2) y is 
therefore a function f: X|Ey — Y such that f = f o 7,,. Obviously, 
such a function is unique. O 


4.12 Prove part (b) of Corollary 4.5: Let [X,Y , f] be a function 
and let Ef denote its equivalence kernel. Then if f is 
surjective, the reducer f is bijective. 


Solution. By the assumption, f,.(X) = Y implying that for each 
y € Y, there is an x € X such that f(x) = y. Thus, for eachy € Y, 
there is a nonempty class [z]p, and consequently, f.(X|E;) =Y, 


that is, f is surjective. From part (a) of Corollary 4.5, LX|E;, Y, f] 
injective, which proves the assertion. O 


5.1 For subsets A C Y; and B,C’ C Y9, prove that 
(i) Laxp(z,y) = 1a(z)1 p(y), Ve € A,y € B. 
(is) (Ax BY =(A° x Yo) U(% x B®). 


Show the validity of the distributive law with respect to cross 
product and union, intersection, and difference: 


(iit) Ax (BUC) =(Ax B)U(AxC). 
(iv) Ax (BNC)=(Ax B)N(AxC). 
(v) Ax (B\C)=(Ax B)\\(AxC). 


Solution. 


(i) Show that the left-hand side and right-hand side are simulta- 
neously equal to 1 or O dependent on whether 
(x,y) € Ax B. 


(it) (A x B)® = [wy (A) 1 73(B)J° = m7(A*) U 03 (B*) 


= (AC x ¥3) U (Yi x BY). 
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For problems (iii-v) we use the pick-a-point process or inverse 
projections: 
(iii) (a,w) € Ax (BUC) S (aE ADA(WE BUC) 

& (aE A)A({we BY V {we C}) 

= ({a Ee ASA {we B}) V (fa € ASA {w € CH) 

= ((a,w) € Ax B)V ((a,w) € AX C) 

(4, wie (Ax B)U(AxC). 


Alternatively, 


[m{(A) 9 13(B)] U [mj (A) 9 715(C)] = (A x B)U(A x C). 


(iv) Similar to (iz) if V is replaced with / : 


o— 


a,w)€ Ax(BNC) S&S (ae ADAWEBNC) 
(ae A)A({weE BEA {we C}) 

= ({aeE ASA {we BY) A (fa € ASA {w € CH) 
= ((a,w) € Ax B)A ((a,w) € Ax C) 

= (a,w) € (Ax B)N (A x C). 


(v) Use the equation B\C = BNC“ and then apply (7iz): 
(a,w)€ Ax (B\C) S&S (aE ADAWE BNC) 

E A)A({weE BEA {w € CF) 

area eer ({a € APA {we C*F) 


ea ((a,w) ¢ Ax C) 
w) € (Ax B)\\(AxC). 


Here is why. Formally, 
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(aw) AxCS(agAAwEC)V(AaEAAWEC). 


But the combination of (a ¢ AAw€C) with (a,w) € Ax B is 
impossible. O 


5.3 _ Is it true that 
(A, M Ag) x (By 9 Bo) = (Ai x Bo) N (Ae x Bi)? 
Solution. Yes it is true. It can be easily shown that 
(A, x By) (Ag x Bo) = (Ar x Bo) N (Ag x By). O 


5.4 On the other hand, using the distributive law in Problem 5.1 
(iti) it appears that 


(A, Ag) x (By Ba) 
= (A; x Bi) M (Az x By) 
M (A; x By) NM (Ag x Bj). 
Is this correct? 
Solution. Yes, it is correct because 
(A, Ag) x (By Bo) = (Ay x Bo) N (Ae x By), 
due to Problem 5.3. O 
5.6 Prove that 
G@) AxUB=U(A KB). 


ie. el 


(ii) LU Al x B=U (A; x B). 


ovk ie. 
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ie. el 


Solution. (ii) [U Aj x B=r* ( U Ai) A r(B) 


ie. ie. 


iel iel 
ss U (A; x B) 
iel 
(iti) Follows by the distributive law and induction. O 


5.9 Let A= <A, x Ao and B= B, x By be subsets of Y; x Yo 
such that A;,B; C Y;,i = 1,2. Prove the validity of the de- 
composition 


A\B = [(A1\B1) x Ay] + [(Ai a) B,) x (Ao\ Be))]. 


In other words, A\B can be partitioned as two disjoint “rect- 
angles: (A;\By) x A> and (A, M By) x (A2\Be). 


Show the validity of yet another decomposition of A\B : 
A\B = [Ai x (A2\Bz)] + [(A1\Bi) x (A2 Ba)). 
Solution. Notice that from Problem 5.2(ii) we have 
A\B = (A, x A2)\(Bi x Bo) 
= [(A1\B1) x Ag] U[A1 x (A2\Ba))- (P5.9) 


In (P5.9), the underlying sets are overlapping. However, we utilize 
(P5.9) to get the expression 


A\B = [(A1\B1) x Ay] + [(Ai a B,) x (Ao\ Be))]. (P5.9a) 
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Clearly, because of (P5.9), 
[(A1\Bi) x Ag] + [(A1M By) x (Ag\B2)] C A\B. 


Next, the sets [(A;\B1) x Ag] and [(A; N B,) x (Ap\Be)] are dis- 
joint, because so are A;\ By, and A; N By. 


We need to prove the inverse inclusion. If w € [(A;\B,) x Ag], then 
this is because the latter is shared by both representations in (P5.9) 
and (P5.9a). So, it follows that w also belongs to the right-hand side 
of (P5.9a). Therefore, assume that w = (#,y) € Ai x (A2\Be) [the 
second set in (P5.9)]. Then, « € A; and y € A»\Bz. If, in addition, x 
also belongs to By, then x € A;M B, and thus, w belongs to the 
second term on the right of (P5.9a): (A:M By) x (A2\B2). 
Otherwise, if « ¢ B,, then x € A,;\B, and thus w belongs to the first 
term on the right in (P5.9a). Either way, w is an element of the right- 
hand side of (P5.9a) showing the validity of the inverse inclusion. 


Therefore, from (P5.9a) it follows that A\B is the union of two 
disjoint rectangles. 


The other decomposition of A\B, 


A\B = [Ai x (A2\B2)] + [(Ai\B1) x (Ao B2)] 


can be shown similarly or by interchanging the roles of A;'s and B;'s. 
O 


5.10 Let {A, C Y,; « © X} and {Z, C Y,;y € X} be two family 
of subsets. Show that 


» (m4)o (1%) = aoe 


vex yEex rEX 


» (a.)u (Ie) 2 Manny 


rex yEex rex 
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Solution. 
» (m4) (qa) = [qe] [qse 


(because the inverse of any map preserves the intersection) 


rEX 


=/()7(4,08,) = [| (4e8;). 
rEX rEX 
b) Follows from a) and AN BC AUB. O 


5.12 Let {A;:i¢€ I} and {B;: 7 € J} be two indexed families of 
subsets of Y; and Y2, respectively. Show that 


[nal x | 9B] = 0 AA BD. 
iel jed jel jeJ 
Solution. 
In Aj x | () Bj 
ie. jet 
=|) N73} (1B; 
ie jet 
= (\ (Ai) N | 1) 73(B;) 
ie. jet 
= 1) NM (mi (Ai) 9 75(Bj)) = M NM (Ai x Bj). O 
iel jeJ iel je 


6.3 (a) Let A be an uncountable set and let B C A be countable. 
Show that A\B is uncountable. (b) If |A| = c¢ and |B| < No, 
show that |A\B] = c. 


Solution. 
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(a) If A\B were at most countable, then A = B + (A\B) is at most 
countable, which is a contradiction. 


(b) We have 
Nx A\BXA. 
By a), A\B is uncountable and thus we have: 


No < |A\B| <e. 


Therefore |A\B| = c, which follows from the continuum hypothesis 
[Remark 6.5(iz)]. O 


6.6 Show that 2%° = ¢. 


Solution. Define f:R — P(Q) (denoting the latter for notational 
convenience by 2%) as f(z) ={qEQ:q<-x}, cE€R, and Q 
denotes the set of all rational numbers. 


a) We show that f is injective. Let a < b € R. Then there isag € Q 
such that a < q < b. By the above definition, gq € f(b) and q ¢ f(a). 
Therefore, f(a) 4 f(b). The same holds if b < a. Consequently, 
R x P(Q). 


b) Let J(N) denote the set of all indicator functions on N. As we 
know, I(N) = 2N. Let F: J(N) — [0,1] such that 


F(14) = 0.140)14(@)--- 


If AF B, then 14 #1, and thus F(1,4) ~ F(18), which implies 
that F’ is injective and then 


I(N) = 2% = [0, 1). 


But 2% ~ 22 and [0,1] +R (see Problem 6.5). Hence 2N = R, 
whereas from a), R~= 2%. By the Schréder-Bernstein theorem 
(Problem 6.2), 28 = R or 2% = €. O 
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6.7 Let [X,Y, f] be a surjective map. Show that there is a subset 
of X equipotent with Y. 


Solution. By the axiom of choice, for each y € Y, there is a point 
ty € f*({y}). Let X’ =) {x,}. Then, LX’, Y, Resx:f] is bijective 
yey 


and thus X’ = Y. O 


6.9 We call an algebraic number any root of a polynomial with 
integer coefficients. What is the cardinal number of all 
algebraic numbers? 


Solution. Let P,, be the set of all n-degree polynomials with integer 
coefficients. Then, clearly, P,, ~ Z"*! (n+ 1 times the Cartesian 
product of all integer numbers). Each such polynomial has at most n 
distinct zeros. If S(P,,) is the set of all algebraic numbers, then it is 
~< -related as follows. 


N =< S(P,) XN x Z™'1 BN, 


where NV = {0,1,...,n}. By the Schréder-Bernstein theorem, S(P,,) 
~~ N. If S(P) 1s the set of all algebraic numbers, then 


SP) = USP) 


and the latter has the same cardinal number as N, according to 
Example 6.3(2v). O 


6.10 Prove the statement. Every infinite set contains a countable 
subset. 


Solution. Let A be an infinite set. Denote f(1) = a; € A, f(2) = 
az € A\{a;},... . We can use Problem 6.7 each time to warrant the 
availability of elements in A\{a,...,a,}. Therefore, [N, A, f] is 
injective. Consequently, N x A and N = f(N) C A. O 
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6.14 Let A= {B,: ax € X} bea countable family (i.e., X ~ N) of 
countable sets. Let 


B= Big hotest Be, WEN A= [hii ea} CO} 


denote the set of all finite Cartesian products from 2. Show 
that B is countable. 


Solution. If n € N is chosen, then A = {2,...,%} C X is selected, 
along with B,, x --- x B,,, then finally a point b, = {b,,,..., bz, } 
(or rather an n-element set) from set B;, x --- x Bz, is picked out. 


In general, all n-element sets can be picked out arbitrarily from 
B;'s, but according to the above Cartesian order, each one of them 
has at most one element from any B;. 


Now, let & be the set of all finite subsets of X. Then, picking out 
a set from &, such as {21,...,2n}, we then form the Cartesian 
product such as B,, x --- x B,, and then select all elements b,, from 
the set B,, x --- x B,, to include them in B. 


Now, if we organize elements of 2 in an infinite matrix 
(assuming for a moment that X = N), with B,'s being its columns, 
then by Example 6.3(iv), this matrix is enumerable, that is, the 
matrix is countable. The set of all of its finite subsets, according to 
Problem 6.13, must be countable. Let 2(,, be the set of all subsets of 
this matrix having n elements and let %,, be the set of all n element 
sets selected in accordance with the above specification, such as set 
b,,. Then clearly, 8, C 2, because %,, by its nature is restrictive, 
whereas 2, is not. For example, 2(,, may contain some subsets with 
all elements chosen from one and the same set B,. Consequently, 


B= U8, CUA SN. Oo 


n=1 n=1 


7.1 Show that each monoid has exactly one identity. 


Solution. Let (G, *) be a monoid with two identities, J, and I. Then, 
I, * Ig = [, and In * I, = Ig. Because * is commutative with respect 
to an identity, the above two equations then yield that J; = I. O 
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7.3 An operation « is called reducible if x * y = x * z implies that 
y = z for all x, y, z. Show that if (G,*) is a group, then * is 
reducible. In particular, show that for each x € G, its inverse is 
unique. 

Solution. From «* y = x* z, it follows that c-!*r*y =a !xax 

z and thus y = z. (Note that y = z implies x* y = x * z, for all z, 

because in x* y, y 1s replaced by its equal element z. The converse 

is not true, unless G is a group and thus x has its inverse. For 
instance, x - 0 = y- 0 does not imply that 2 = y.) The above shows 
that each inverse in a group is unique, if in 7 * y = x * z we replace 

y and z with x~! and (27!)y. O 


7.5 Let [G,G, f] be an isomorphism. Show that [G,G, f~!] is also 
an isomorphism. 


Solution. We only need to show that f~! is a homomorphism from G 
into G. Suppose there are 7,7 € G such that 


f° @«N A LI @*f-@. (P7.5) 


Then, applying f to the left- and right-hand sides of (P7.5) we get 
that T+ A +7. O 
7.7 Let [G,G, f] be an isomorphism. Find Kerf. 


Solution. From Problem 7.6, Kerf = J, because f is bijective. i 


7.9 Let G be the set of all 2 x 2 real matrices whose determinants 
equal 1. 


a) Show that (G, - ) (with the usual matrix multiplication) is 
a group. 
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b) Let B be any 2 x 2 nonsingular matrix. Define the map 
[G,G, f] such that f(A) = B-'AB. Show that [G,G, f] 
is an automorphism. 


Solution. a) - is associative. Then, the 2 x 2 identity matrix J 
belongs to G, because det! = 1. For each ACG, A exists. 
Therefore, G is a group. 


b) [G, G, f] is a homomorphism. Indeed, by the definition of f, 
f(AC) = BACB = (B-!AB)(B-CB) = f(A) f(B). 
Furthermore, 
det(B-! AB) = det(B-)det(B)det(A) = 1. 


Thus, B-'AB € G and f is an endomorphism. Now, 


f-\(A) = (B-!AB)“! = Bo1A-B. 

Because A~! € G and thus obviously, det[ f~!(A)] equals 1, f~'(A) 

€ G. Consequently, f is bijective and [G,G, f] is an isomorphism. 

Finally, it all makes [G, G, f] an automorphism. 

7.11 Show that l? (defined in Example 7.11 (v)) is a vector space 
over C; specifically, show that x,y €l? > x+y €I?. [Hint: 
Apply the inequality in Problem 7.10 in the form |a, + yn|’ < 
2°-*(|en|? + lynl?).] 

Solution. First observe that for all p > 1, from Problem 7.10, 

Lee = Uy|” S Es ar lynl |? S oP laa + \yn|”). 

Thus, 


Dncilen + Yl? < POY (lanl? + yal”) 
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= Py teal a done lYnl”) < 0O, 


if x = {rn}, y= {yn} € I”. Hence, {an + yn} € 1”. Thata-a € IP 
(where a € C) 1s trivial. im) 


7.14 Let S bea subset of C. Argue for what cases S' is a subspace of 
C over R: 


a) S is a closed unit disc centered at zero; that is, S = 
{zeEC: |z| <1}. 


b) S = {z € C: {|Re(z)| < 1} x {|Im(z)| < 1}}. 
c) S = {z € C: {Im(z) = 0} x {|Im(z)| < 1}}. 


d) S = {z €C: Im(z) > 0 and Re(z) > O} U{zeEC: 
Im(z) < 0 and Re(z) < 0}. 


Solution. S's in a - d) are not closed relative to the addition and in a - 
c) not closed relative to the multiplication of vectors by scalars. 
Thus, none of S' in a- d) is a subspace. O 


7.16 Let (X,R, +,-, x) bea Riesz space and let A, B C X such 
that infA, infB, inf(A+ B) € X. Prove the following equa- 
tions. 


(i) inf(A+ B) = infA + infB. 
Solution. — inf(A + B) = sup( — A— B) 
= sup( — A) + sup( — B) = — infA — infB. O 
(ii) Forany a > 0, ainfA = inf(a@A). 
Solution. ainfA = a( — 1)sup( — A) 
= — sup(( — 1)aA) = inf(a@A). a 
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(iit) For any a < 0, the following hold: asupA = inf(a@A) 
and ainfA = sup(aA). 


Solution. Similar to (i7). O 


7.18 Let (X,R, +,-, xX) be a Riesz space. Prove the following 
inequalities. 


(i) Wesay that A<uifu€U(A). Let AX uand B Xu, 
then sup(A U B) X u. 


Solution. Obviously, AU B X wand thus sup(A U B) ~ u. O 
(i) Ifa zandy < z, then z Vy = z. 


Solution. By (1), {x,y} xX z, because z is an upper bound for sets 
{x} and {y}, we have x V y X z. O 


(ait) Ifa < zand —ax < z, then |z| x z. 
Solution. Follows directly from (71) and the definition of |z|. CO 
(iv) The inequality —z < x X z is equivalent to |x| X z. 
Solution. From—2x x z we have by Axiom 2, 
=¢4 (2 =2) 5 ee le = 2) oS 2 Se, 


Thus, x < z and —x = z is equivalent to —z < x = z. On the other 
hand, from |x| = x V (—2x) ~ z it follows that 


x x |x|,-x < |x|, and « x z and —z Xz, 
which yields |x| < z by (iz#). O 
(v) The triangle inequality: |x + y| x |x| + ly]. 


Solution. First notice that if a x c and b<d, thena+b~xc+d. 
The proof is simple) a+bxc+bxc+b+(d—b). Now, 
because a X ja] > r+ y X |z|+|y| and —2-—y X |x| + yl, the 
statement then follows from (v7). O 
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8.1 Prove the statement in Proposition 8.8(7) that the subset 
Kerf C V is a linear manifold. 


Solution. First, by Problem 7.6, Oy € Kerf. Secondly, if x,y € 
Kerf, f(x) = f(y) = Ow, which yields 


flax + By) =af(x) + Bf(y) = abw + BOw = Ow 
[by 7.6 (iv) properties of vector spaces] 


implying that ax + Gy € Kerf (by the definition of Kerf holding for 
homeomorphisms). Notice that in the proof it was essential that both 
VY and W were vector spaces over the same field. O 


8.3. Let X bea finite-dimensional vector space and let Y C X bea 
linear manifold. Show that dim Y < dim_X. 


Solution. Let A be a Hamel basis for Y. Thus, A is a linearly in- 
dependent subset of X. By Proposition 8.13, there is a Hamel basis 
H for X such that A C H. Thus card(A) < card(#). O 


8.6 Let X be a vector space and let Y, Z C X be two linear mani- 
folds. Prove the statement. For each x € Y + Z, there is a 
unique decomposition of x by vectors y © Y and z € Z such 
that x=y+z if and only if YNZ ={6}. (Two such 
manifolds are said to be /inearly disjoint.) 


Solution. (i) Suppose Y and Z are linearly disjoint. Let x € Y + Z 
such thatz =y+z=y'+2' withy,y’ € Y and z, 2’ € Z. Then, 


Ysy-y=2/-zEZ 


implying that y — y’ and z’ — z belong to either Y and Z. But due to 
our assumption, Y and Z are linearly disjoint, thus we have y = y 
and z = 2’ proving that indeed x = y + z is a unique decomposition 
of x. 
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(ii) Suppose each x € Y + Z admits a unique decomposition by 
vectors from Y and Z but suppose that Y and Z are not linearly 
disjoint. Then there is a vector w € Y MZ such that w 4 0. By the 
assumption, X >x=yt+z2=y-—aw+z-—aw, good for all 
aéF. But y—aw€éY and z—aw € Z meaning that the above 
decomposition is not unique, which is a contradiction. O 


8.7. Let X be a finite-dimensional vector space and Y;, Yo C X be 
two linearly disjoint manifolds. (See Problem 8.6.) Then, 
Y, + Y is called a direct sum of Y; and Y2 and it is denoted by 
Y; @ Yo. Prove that if X = Y; © Yo, then dim X = dim Y; + 
dim Y5. 


Solution. By Problem 8.3, dimY, < dim X and dim Y2 < dim X. 
Thus there are Hamel bases, Hy = {a,...,a,} for Y; and Hy = 
{b1,.--, 0m} for Y2. Denote H = {a,..., a4, 61,..., bm}. First, we 
show that H is linearly independent. To render it we let 


k m 


Because 9 € X and yo 04a; = :y, € Y, and 190; = :yeE 
Y2 we have 0 = y; + y2 being a decomposition of # as an element of 
X by two elements of Y; and Yo, respectively. Due to the assumption 
that Y; 6 Y2 is a direct sum. Thus, by Problem 8.6, this decom- 
position is unique because 6 = 6 + @ is a legitimate representation of 
0. We conclude that eer = 1 9b; = 0. Then, because H, 
and H» are Hamel bases, we have a; = 3; = 0 for all relevant 2's and 
j's. Consequently, H is linearly independent. 


Secondly, we show that span H = X. Because X = Y, + ¥2, for 
each x € X,x =y, + yo holds, where y, = So 04a; and yo = 
>, 4;b;. These imply that x = yo aia; + yo. 5jb; and thus 
serves as an evidence that span H = X. The latter is equivalent to 
dim X =k+m =dimY, + dim Y9. i 
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Chapter 2. Analysis of Metric Spaces 


1.2 Let X =Randd(z, y) = \/|x — y|. Is (X, d) a metric space? 


Solution. d is a metric. The first two properties are evident. To show 
the triangle inequality, we use 


Vial + [ol < Val + v/lel 


and 
d(z,y)= J/|z-yl=Jlz-z+2z-yl 
< Jla-z/+lz-y! 
identifying x — z asa and z — yas b. O 


1.4 Let d bea metric on X. Define 


d(x,y 
pa, y) _ at . 
Show that p isa metric on X. 


Solution. We need to show just the triangle inequality because the 
two other properties of p as a metric are obvious. Let 


d; = d(x,y), dz = d(a, z), and ds = d(z, y). 
Because d; < dz + d3, we have that 
dy <dg+ dz < dy + 2dod3 + ds, 


which implies that 


_ ad _ _ds dy ds 
ay|1 Td; ~ T+ds| S Tray + Tray 


The latter, after division by 1 + d;, is equivalent to 
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qs UL dh) ee a (1 + dh) ey 
which yields the desired triangle inequality. O 


1.6 (a) Prove Hdlder's inequality for finite sums: for conjugate 
exponents p > 1 and q > 1 such that : - Sl Oi ein ge 
O,. and Diycix5 Oy, 0); 


n n 1/p n 1/q 
aid; < ba bul : 
i=l i=l 


i=1 
Hint: Apply Problem 1.5 to x = a;/A and y = b;/B, where 
1/q 


n 1/p n 
A= [Sat] and B= bul 
7=1 i=l 


(b) Generalize Hélder's inequality for infinite sums. 


Solution. a) With the above notation of A and B, assume that 
AB > 0. Otherwise, the inequality is trivial. Now apply Problem 
1.5, with x = a;/A and y = 6;/B to yield 


p P 
a; 0; a; bj 


A B — pAp ae qBP* 


After summation over 7 we have 
Diab < (4+ 1)AB= 4B 


as the Hélder's inequality. 


b) The inequality holds when we increase the right-hand side as 


n [ere] 1/p lore) 1/q 
diaidi < ba ba 
i=1 i=1 


i=l 
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Finally, it still holds when passing to the limit n — oo on the left: 


1.9 


1/q 


eo) 0° V/PT ow 
i=1 i=1 i=1 


In Problem 1.8 we defined the Euclidean metric on R” by 
equation (P1.8). This metric can be regarded as 


dp(x,y) _ {2 dy (ty, We) (P1.9) 


c= (21, sang ny y= Gigsse Wa) (P1.9a) 


where d;,(xz, yx) 1s the one-dimensional Euclidean metric on 
the kth coordinate axis (Ath factor space). We can extend this 
notion and define a metric on the n-times Cartesian product set 
Y =Y,x.---x Y, by formulas (P1.9-P1.9a). This problem 
asserts that such d, is indeed a metric on Y. We call this met- 
ric the product metric and the corresponding metric space 
(Y,dp) the product space. In notation, x {(Y;,, dp): k = 1, 


aad Tie 


Prove the statement: Let (Y;, dy), k=1,..., , bea collec- 
tion of metric spaces and let Y be the Cartesian product of 
Y1,..., Yn. Then the function dp on Y x Y defined by (P1.9- 
P1.9a) is a metric on Y. 


Solution. It remains to prove the triangle inequality. Let 


Then, 


ak = dk (Xk, Yk) and by = dil uk, Zk), i= Pere Nn. 


dp(z,y) + dey, 2) = Jet Je 


(by Minkowski's inequality) 
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> Jf >rpai Ge + ye)? = (Xa (dior, Yr) + dk(YR, 2)” 


(because d;, is also a metric on Y;) 


> J Dialde(ae, zp)? = dp(e,Z). im 


2.1. Show that0A = 0A° = ANAS. 


Solution. By the definition, a point x € (X,d) is called a boundary 
point of A if every open ball at x contains points from A and from 
A°. On the other hand, a point « € X is called a closure point of 
A C X if every open ball centered at x contains at least one element 
of A. O 


2.3. Show that [A‘]° C A. 


Solution. We show the equivalent inclusion that (A)* C A®. Let 
x € (A)°. Then, x ¢ A and x ¢ A, which implies that x € A° and 
thus, x € A°. J 


2.5 Ifx€ OA, must x be an accumulation point? 


Solution. No; see for instance Example 2.12 (v) where 2 is such a 
point: 


(v) Let A =(0,1)U {2}. Then, A = (0,1), A= [0,1JU {2}, A’ 
= [0,1], 0A ={0,1,2} (since A° = (—co,0) U [1,2) U (2, 00), 
Af = (—00, 0] U[1,00), and AN A¢ = {0, 1, 2}). O 


2.7 Let AC (X,d), where X is an infinite set. Show that if x is 
an accumulation point of A, then every open set containing x 
contains infinitely many points of A. 


Solution. Suppose there is an open set O containing x and just 
finitely many points x1,...,2, of A, none of which is x. Clearly, 
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é: = min{d(x, 71),...,d(x, 2,)} > 0. Then B(x, €) contains at most 
one point of A, which is x if x € A. Consequently, B(x,¢)N 
(A\{x}) =@, that is, x is not an accumulation point of A and this 
is a contradiction. L 


2.8 Give an example of a continuum closed set that does not have 
any accumulation point. 


Solution. Consider the set A = (a,b) (a < b) in the discrete metric 
space with the carrier R. Then, A does not have any accumulation 
point, because for each x € A, B(x, $) = {x}. In other words, for 
each x € A, there is an open ball centered at x containing only «x. 
(Recall that A is closed.) O 


2.11 Prove that A = A +0A. 


Solution. a) We prove that AN A® = O. Let x € A. Then, there is 
an r such that B(z,r) CA > B(a,r)N AS =O => w¢ A’. 


b) We show that Au a= X (the carrier), which would be 
equivalent to A = (Ac)". Obviously, 


a¢ Ae & Ir: Ba,nNA =O 6 Bie,r)CA & EA. 
c) Prove the statement. 
A=AnX=An(A+A) 


—~A+(ANA)=A+ OA. oO 


2.12 Show that AUB=AUB. 


Solution. A closure point of A is obviously a closure point of AU B. 
Thus A C AU Bandsois B C AUB. Therefore, AUBC AUB 
and we need to show the inverse inclusion. Suppose there is a point 
xz € AUB that belongs neither to A nor B. The latter is equivalent 
to the existence of balls B(xz,7r,) and B(x, rz) that do not meet A 
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and B, respectively. Thus the ball B(x,r), where r = min{r1,r2} 
does not meet A U B. But this is absurd, in as much as ~ is a closure 
point of AU B. O 


2.15 Prove Corollary 2.14.4 set A is closed if and only if A = A. 


Solution. Let A be closed. Then A is a closed superset containing A. 
By Proposition 2.11, A is the smallest closed set containing A. 
Thus, ACA => A=A. Now, let A= A. Because A is closed, 


by Proposition 2.11, A is closed. a 


2.17 Define S(z,r) = {y € X : d(x,y) =r} and call it a sphere 

centered at x with radius r(r > 0). In Euclidean space 
(R",d-), obviously, S(z,r) = OB(z,r). Is this true for metric 
spaces? If it is not, then is there at least a relationship between 
S(a,r) and OB(x,r) common for all metric spaces? Are there 
any exceptional cases? Explain. 


Solution. (1) Due to Remark 2.15, 
B(x,r) € B(a,r) € C(#,r). 
The inequalities will continue to hold if we intersect each of the three 


sets above with one and the same set. Therefore, the inequalities will 
hold if we subtract a set yielding 


O= Ba,7)\\Bia rr) 6 Bar) \ Be) 
CCla,7)\ BG, 1); (P2.17) 
(it) Now, obviously, S(x,r) = C(a,r)\B(a,r). (P2.17a) 


(iti) On the other hand, by Problem 2.1, 


OBa.7)= Bary Bar) =Ba rn barny 


= Bier), (P2.17b) 
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because (B(x, 1))° is closed. From (P2.17-P2.17b) we thus have 
Gb(a7) GC S(@,7) 

holding for all metric spaces. 

(iv) However, 0B(x,r) # S(x,r), as we can see it from the 

example of the discrete metric, in which 0B(xz,1r) is a proper subset 


of S(a,r). Indeed, from Example 2.2 (iv), Remark 2.15, and 
(P2.17a), 


_ 0, rl 
whereas by Problem 2.16, 0B(z,1r) = @ for all r. O 


3.2 Prove that a set A C (X, d) is nowhere dense in X if and only 
if the complement of its closure is dense in X. 


Solution. Let B = A. By problem 2.11 part (b) (in its solution), 
B=0 & Be=X. 


An alternative and longer proof. We prove that “A is nowhere 
dense” is equivalent to “(A)° is dense in X.” Let B be a subset of X 
such that Be =X and suppose that B #@. Then, there is an 
2eEB C B. Then, there is a sequence {x,,} C B® such that rz, — 
x. Because x is an interior point of B, there is an open ball 
B(«x,r) C B. Furthermore, there is an r-tail {x;,,...} (of {x,}) 
entirely in B(xz,r), thus in B, not in B°, which is a contradiction. 
Thus, B=0. Conversely, let B ~ © (because the proof would be 
trivial), but B=@. Then, for each x € B and for each r > 0, 
B(x,r)Q B° £@. Consider the monotone decreasing sequence 
B(x,+) of balls from each of which we select a point x, € BY. 
[Such a point can be chosen because of B(x, r) MN B® # @.] Clearly, 
rn — «x. If x € B°, the same statement is trivial. Thus, Bo = X. 
Finally, we take A for B. O 
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3.3 Assuming that (IR, d.) is complete (a known fact from 
calculus) prove that (IR”, d.) is also complete. 


Solution. Let {x")} be a Cauchy sequence in (R”,d.). Denote 


{al} the ith component sequence of {a:‘")}, Because 


de(a"”, a) < d.(x™, 2), 


a 


each component sequence of {x'")} is also Cauchy and thus con- 


vergent in (IR, d.). Let gl”) _, ee and 2 = (ry 05 t,). Then; for 


a 


each e/,/n > 0, there is an N; such that for each k > Nj, 


| ol” — a | <e//n. 


Let N = max{Nj,..., N,}. Then, for each k > N, 


d.(el"), 2) = (TR (al — 2)? <e. o 


3.4 Show that any Cauchy sequence is bounded. 


Solution. Recall that a sequence {x,,} is called bounded if for every 
n, d(%1,%,) <M, where M is a positive real number. Now, if 
{z,} is Cauchy, for every ¢ >0 there an N(e)-tail of {z,}. It 
means that GC (Bins We) Ee if m,n>WN. Let 
D2 = wax d(a,03)58 = yan, NV}. Then, 


d(x1,%,) < d(a1,2y) +d(ay,%n) < Dte, 
for all n > N, and 
d(#1,%,) < D, forall n<N. Oo 


3.7. Show that if {x,,} C (X,d) is a Cauchy sequence and {z,,, } is 
a subsequence convergent to a point a € X, thenz, — a. 
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Solution. d(%n,@) < d(%n,Xn,) + d(an,,a). Now, because {z,} is 
Cauchy, for each ¢ > 0, there is an Nj-tail of {x,,}, and thus of 
{tp}, such that d(%,,%p,) <e¢ for all terms of this tail, and 
d(%n,,a@) <€, for all nz, > No, due to the convergence of {z,, }. 
Therefore, d(%n, a) < 2e, foralln > N = max{N,, No}. O 


3.8 Show that in (R”",d.), B(x,r) = C(z,r), where C(z,r) is a 
closed ball of Remark 2.15. 


Solution. a) Because C'(x,1) is a closed set and because B(x,1r) C 
C(x,r), B(x,r) C C(x,r) (which is also due to Remark 2.15). 


b) Show the inverse inclusion. Let y € C(x,r). Define y, = 4” + 
(1 — +) y. From 


tosm) = (Thal Far Dk 


< (d = +) aa 
it follows that y, € B(«x,r). Furthermore, d-(yn, y) — 0. Thus, by 
Theorem 3.3 (A point x is a closure point of a set A if and only if x 
is a sequential limit point of A.), y € B(x,r). O 


4.1 Show that if A is totally bounded, then A is bounded. Give an 
example, where a bounded set is not totally bounded. 


Solution. (i) Let A be totally bounded. Then, Ve > 0,5 a finite cover 
of A by balls of radius e: (J B(a;,¢) D A. Let x,y € A. Then, x € 
i=1 


B(a;,€) and y € B(«,;,€) implying that 
<e+M+e=2e+M, 


where M: = max{d(xp,z,) : p,r =1,...,m}. Consequently, 
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d(A) = sup{d(z,y):2,y € A} < 2e+M 


making A bounded. 


(iz) Consider (R,d), with d(x,y) = min{|z —y|,1} and A= 
[(0, co). Then, A is bounded, but obviously not totally bounded. O 


4.3. Show that a continuous bounded function on a bounded inter- 
val need not be uniformly continuous. 


Solution. Let f(x) = sint on I = (0, 2]. Clearly, f is continuous on 
I. Let 


— _2 _ =e ul = 
Ln = (Qn+l)n? Un = na? n= Fe ae ‘ 


Then f(x,) = +1 and f(y,) = 0. Therefore, | f(r) — f(yn)| = 1. 
On the other hand, 


1 
Zn — Yn| = aORtHe 


which can be made arbitrarily small. Therefore, for any « < 1, there 
is no 6 good for all x € J required by uniform continuity. 


Another solution. Take a sequence {x,, = +} C I, which is clearly 
Cauchy. However, f(x,) = sin(4') is an oscillating, non - Cauchy 
sequence. Thus f is not Cauchy continuous and by Proposition 4.16 


f is not uniformly continuous. O 


In Problems 4.4 -4.8 it is assumed that f and g are functions from 
(R, d.) to (R, d.). 


45 Let f: A-R be a differentiable function such that its 
derivative f’ is bounded over A, where A is an arbitrary 
(bounded or unbounded) interval. Show that f is uniformly 
continuous on A. 
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Solution. From the mean value theorem, 
f(z) — f(y) = f\(c)(@—y), forac € (az, y). 


Hence, |f(x) — f(y)| < M|x—y|. For an e>0, take 6= 5. 
Then, |x — y| < 6 implies | f(x) — f(y)| <e. O 


4.6 Show that if f and g are uniformly continuous on R and boun- 
ded then fg is uniformly continuous on R too. However, if f 
and g are just uniformly continuous then fg need not be uni- 
formly continuous. 


Solution. (i) Let |f| < My and |g| < Ms and M = max{M,, Mp}. 
Select ¢ > 0 and let 6 > 0 be such that Vz, y € X with |x — y| < 6, 
|f(z) — f(y)| < aig and |9(x) — g(y)| < afz- Then, 


lf(x)g(x) — flya@)| < IF (@)Ila(@) — gy) 
+ lo(y)||F(2) — fy)| < Mag + May = 


(iz) Although the function f(x) =a is uniformly continuous, 
g(x) = 2? = 2- zis not. oO 


4.8 Let f be a continuous function and g a uniformly continuous 
function on a set A such that |f| < |g|. Is f then uniformly 
continuous? 


Solution. No. For instance, if f(x) = sin4 on I = (0, 2] (of Problem 
4.3), then |f| <1. O 


4.10 Show that in R”, Euclidean and supremum metrics are equi- 
valent. 


Solution. The statement follows from two inequalities, 


£,Y) < V/Ndoo(Z,y), doo(t,y) < de(z, y), 
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and Remark 4.9. O 


4.12 Let (X,d) be a metric space and define the function 
p: (X?, X”) =~ Ry as p(x, y) — d(x1, 41) + d(x2, y2), where 
X = (11,42) andy = (y1, y2). (a) Show that p is a metric on 
X?. (b) Define the function f(x) = d(x,,x2). Show that 
f : X? =R, is uniformly continuous on (X?, p). 


Solution. (a) is obvious. 
(d) f(x) — F(y)| = ld(x1, 22) — d(ys, y2)| 
= |d(x1,@2) — d(y1, 22) + d(x2, 41) — d(y1, y2)| 
< |d(ax1, 2) — d(yn, €2)| + |d(w2, m1) — d(C, y2)| 
< (x1, 41) + d(@2, yo) = p(x, y). - 


4.14 Give an example of a continuous function [X,Y, f] from 
(X,d) to (Y,p) such that f is not Cauchy continuous. (Hint: 
See Example 4.15.) 


Solution. The function f(x) = arctanz is convergent to = when 
x — oo. Thus so does the sequence x, = arctan(n) — 5 when 
n — oo. Therefore, {x,,} must also be Cauchy. However, {rtan(,,)} 
= {n} is clearly not Cauchy. O 


a 
2 
cs 
2 


4.16 Prove the following statement. Let (X,d) and (Y,p) be two 
metric spaces and let |X,Y,f| and |X,Y,g] be two 
continuous mappings. The subset A= {x € X: f(x) = g(x)} 
is closed. 


Solution. A is closed if and only if it contains all of its sequential 
limit points. Let x € A and let {x,,} C A such that x, — x. Thus, 
for each n, F (an): = f(%n) — g(@n) = 0. Because F = f — g is 
continuous, F'(xz,) — F(x), because {x,} is convergent. But 
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obviously F'(x,) — Oimplying that f(x) = g(x) and thus xe A 
>ACA. 


Another Solution. We show that A° is open. Let a € A® and 
p(f(a),g(a)) = 3M. Because a € A°, M > 0. Because f and g are 
continuous, for each « > 0, there is a 6(a) > 0 such that 


p(f(@), f(a)) < € and p(g(x), g(a)) <eé 


for all x such that d(x, a) < 6(a). Therefore, 


3M = p(f (a), 9(a)) 
< p( f(x), f(@)) + p(9(2), g(a) + p(f (x), 9(2)) 
< 2 + p(f (x), 9(2)). 


Let now ¢ = M. Then for all x such that d(x,a) < 6(a), we have 
that p( f(x), g(z)) > M, that is, for all x € Ba(a,6(a)), f(x) F 
g(a). Therefore, Ba(a,6(a)) C A® yielding that each point a € A° is 
interior. O 


5.1 Using similar arguments as in Example 5.6, show that the limit 
of any uniformly convergent sequence of continuous bounded 
functions from (X, dp) to (Y,d) is a bounded and continuous 
function. 


Solution. Let {f,,} C F.((X,do),(Y ,d)) be a sequence of continuous 
bounded functions. Suppose that f is the limit of {f,,} in the sense of 
the supremum metric p. Thus, for each ¢« > 0, say « = 1, there is an 
N = N(1) such that p(f,,f)<e¢ for all n> N. In particular, 
fn € Fy, therefore 


Af, 9) < p(f, fv) + e(fn, 9) < 1+ M(fn): = M. 


Hence f € F,. Now we show that f is continuous. Given an x and 
n, for each ¢ > 0, there is a 6, >0 such that for every y with 
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do(x,y) < 62, d(fn(x), frly)) < €. Now, by the triangle inequality 
used twice, 


d( f(x), f(y) < d(f(2), fr(®)) 
+ d(fn(Z), fry) + A fn(y), fr(Y)) 
SOS ida) SP a 


Taking n > N, we have the last expression dominated by 3¢. This 
yields continuity of f. O 


5.3. Let {C,,} be a sequence of closed balls in (R",d.) such that 
each of the balls C,, is centered at a point x) € R” and has 


CO 
radius 4 ,n=1,2,... . Find the intersection (] C,. 


n=1 


love) 
Solution. By Cantor's Theorem 5.4, (]} C,, has exactly one point. It 
n=1 


is easily deduced that x9 is the one. O 


5.6 Let F C F,(X;(Y,p)), where Y is a vector space. Prove that 
F is d,.-bounded if and only if there is a positive constant 
such that for all f € F, d(f,0) < M. 


Solution. Recall [Example 5.6(7)] that F is p-bounded or uniformly 
bounded if diam F = sup{dx(f,g) : f,g € F} < oo. 


a) Suppose F is such that for each f € F, d(f,0) < M. Then, for 
all f,g € Ff, 


doo(f,9) < doo(f,0) + doo(g, 0) < 2M. 
Therefore, diam F < 2M. 


b) Suppose that F is d,.-bounded, i.e. diam F < oo. We need to 
show that d.(f,0) < M for all f € F. If F is a finite subset, then 
the proof is trivial. Suppose F is infinite and let sup{d,(f, 0): 
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f € F} =o. Then, there is a sequence {f,} CF such that 
doo(fn, 0) — oo. Because d,, is a metric and thus d.,(g,0) < co 
for any g € F, we have 


doo Fins 9) Zz ldco (fr 0) _ ds.(g, 0)| 7 ©. 


This contradicts the assumption that diam F < oo. O 


5.8 We can make use of the fact that the Euclidean and uniform 
(supremum) metrics are equivalent (see Problem 4.10) to show 
completeness of (R",d.). For n = 1, it is wellknown from 
calculus. Prove completeness of (IR”,d,.) for an arbitrary n. 
(See Problem 3.5.) 


Solution. a) We show that d, and the supremum (uniform) metric are 


: . \) de : 
equivalent on R”. Let «‘*) —S a. Thus, for each ¢ > 0, there is an 
N such that for all k > N, 


d(c, 2) = (OX a — a? <e. 
Now, obviously, 


doo(x), x) = max{ ey — Xi 


.s= tect) < d(x), x), 


which yields that d(x"), 7) < € as well. Conversely, let 


Then, because d(x"), x) < \/ndx(x™, x), we easily arrive at 


») de 
a) Sg, 


b) Now, let {2:“)} be a Cauchy sequence in (R”, d,). Then, {ax'*}, 
for each 1 = 1,...,n, is Cauchy in (R, d.) [because de(as*? a”) < 


be EO’ 
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d-(a), ™)), But then, for each i, {a\*) is convergent, as (R, d.) 
is complete. Let 


gl") . nF 
Thus, 

max{d(x\"), x;); 4 = 1,...,n} — 0, 
LG. 

®) Sy 


Due to the equivalence of Euclidean and supremum metrics (a), 


ak) s x. O 


6.1 Show that if {z,} C (R",d.) with d(a,,0) <3, then {x;} 
has a convergent subsequence. 


Solution. Let A = {x € R": d(x, 0) < 3}. Then A, as a closed ball, 
is closed and bounded, thus compact. By Theorem 6.3, every 
sequence in A must have a convergent subsequence. O 


6.3. Let A,B C (X,d) such that A is compact and B is closed. If 
An B=@Q, show that d(A, B) > 0. 


Solution. First observe that 


= 0, reEB 
d({x}, B) 1S { Ss 0, TE (BY. 


Indeed: 1) If x € B, for eache > 0, B(x,ce)N B 4 @. Thus, there 
is yp € B(a,e) NB. Clearly, d(x, yz) < € implying that 


d({x}, B) < d(x, yp) <e 


= dx}, B)= 90. 
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2) If x €(B), there is an open ball B(z,r) such that 
Bia, r)NB=08 => Vyp eB, d(x,yg)>r. Obviously, the 
converses to each of these statements hold. 

Now, by Problem 6.2, there is an a € A such that d({a}, B) = 
d(A, B). If d(A, B)=0, then d({a},B)=0 and hence a€ B. 
However, because B is assumed to be closed, a € B and then AN B 

# @, which is not the case, by the assumption. Thus, 


d(A, B) > 0. O 


6.4 Let AC (X,d). Show that if A is totally bounded then A is 
also totally bounded. 


Solution. Choose some ¢€ > 0. Let « € A. Then, B(x, §)N AF @. 
Let ye B(w,5)N A. Then d(z,y) < 5. Because A is totally 
bounded, 


ACU Bay, §). 


i=l 
Thus, y € B(x;, 5), for some 7 and 


d(x, xi) < d(x, y)+ dy, vi) < §+§ =e. 


Therefore, x € B(x;,¢). Because A C LJ B(a;,€) and an arbitrary 
i=l 

x € A belongs to one of the balls of the e-cover of A, it follows that 

{B(x1,€), ---, B(an, €)} also covers A. O 


6.6 Show that sequential compactness of a subspace implies its 
completeness. 


Solution. Let {x,} be a Cauchy sequence in A. Then, {2,,} has a 
convergent subsequence {z,,,}. Denote the limit of this subsequence 
by x. Then, for each ¢ > 0, there is an N’ such that for all k > N’, 
d(%p,,,0) < €. On the other hand, because {z,,} is Cauchy, there is 
an N such that for all m,n > N, d(2n,%m) < €. Now, 
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A(Ln, x) = d(Zn, Div) + Ce Gian x) < 2€, 


when n,n; > max{N, N’}, which shows that x is also the limit of 
{Zn}. O 


6.7 Prove Theorem 6.2. Any subset of R" is Lindelfiset in 
(IR", d.). In particular, (IR",d.) is a Lindelf space. 


Solution. (i) Let {O; : i € I} be an open cover of R” and let x € R". 
Then, there is an element O of the cover {O; : i € I} that contains «x. 
By Problem 2.19, there is an open ball B(q,r) C O such that q € 
Q",r EQ, and x € B(q,r). Consequently, we can cover R” by all 
such balls, each associated with x € IR”. Because each such ball has 
rational parameters qg and r, the entire set of all these balls is count- 
able. Let us enumerate these balls as {B(q.,7r,) :k € N}. Obvi- 
ously, each of these balls contains more than one points such as x 
above. 


Now, because B(q,,7,) is contained by some O, say O;,, it follows 
that {O;, : k € N} also covers R", but in this case {O;, : k € N} isa 
countable reduction of the original cover {O; : i € I} proving that 
(R", d.) is Lindel6f. 


(ii) The proof is identical if we replace R” with a subset A C R". O 


6.9 Show that a mapping f : (X,d) — (Y, ) is continuous if and 
only if, for any compact subset KK of X, Resx f is continuous. 


Solution. (i) Let K be the set of all compact subsets of X and f be 
continuous on each K € K. In other words, givena K € K, Resx f 
is continuous. We show that f is continuous on X. Throughout the 
proof, we will be using Heine's Theorem 4.4: A function f: 
(X,d) — (Y,p) is continuous at x € X if and only if for every 
sequence {x,,}, d-convergent to x, its image sequence { f (x,,)} is p- 
convergent to f(x). 
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Let {x,,} be any convergent sequence, say to a point x and let A = 
{@,%1,X,...}. Obviously, A is sequentially compact. Thus, by 
Theorem 6.3, A € K. By the assumption, f is continuous on A. 
Thus, according to Heine, f(y) — f(y) for every convergent 
sequence {y,} C A. In particular, it should work for {z,,}, that is 
f(an) — f(x). In summary, we showed that given an arbitrary 
sequence {x,,}, convergent to a point x, f(x,) — f(x). This, again, 
by Heine, implies that f is continuous on the entire X. 


(it) The converse. Suppose f is continuous on X and let kK € K. 
Then, by Theorem 6.3, K is sequentially compact. In particular, it 
means that if {x,,} is a sequence convergent to a point x and if {x,,} 
C K, then x, — x. (See Problem 3.7.) In an alternative argument, 
by Theorem 6.7, K is closed and bounded. Thus, by Corollary 3.4, 
K contains all of its sequential limits. Thus, if {x,,} is a convergent 
sequence in K, its limit, say x, must belong to kK. Then, by Heine's 
Theorem, because f is continuous on X, f(x,) > f(a). But 
because {x,21,%2,...} CK, it follows that f has the Heine's 
property on K, and because {x,} was an arbitrary convergent 
sequence in Kk, with all said above, f must be continuous on K, i.e., 
Resx f 1s continuous. O 


7.2 Define the space [°° as the set of all bounded sequences x = 
{x1,2,...} CC. Show that I is an NVS with the norm 
defined as ||z|| = sup{|x,|: i = 1,2,...}. 


Solution. We show the triangle inequality. Other properties of the 
supremum norm are obvious. 


xi + yi| S |i] + [ys] < lla] + lly 
Thus, 
Iz + y|| = sup{|zi + yl} < [all + Ilyll.- 


To show linearity, observe that if x,y €1°, then x and y are 
bounded and so is ax + Gy. im 
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7.4 Let F,(Q) be the space of all bounded real-valued functions on 
Q. Show that F, is a vector space. Let ||f||,, = sup{|f(w)|: 
w € Q} be the supremum norm defined in Example 7.7(cit). 
Show that the supremum norm in Ff, is indeed a norm and 
show that F,, is a Banach space with respect to this norm. 


Solution. That || - ||,, is a norm can be readily shown as in Problem 
7.2. We show that F,.(Q) is Banach. Let {f,,} be a Cauchy sequence 
in F,(Q). Then, given « = 1, there is N = N(1) such that for all 
m,n > N(), |lfn—- fall. < 1. Let m = N. Then, 


Ilfnlloo S Ifa — Favlloo + llFrlleo 
<1+My:=M,n2>N, 


that is, {f,} is uniformly bounded. The latter is also due to Example 
5.6(2). Because the Euclidean norm is complete, by Example 5.6(72) 
and Problem 5.7, (F,({), || - ||...) is Banach. O 


7.5 Show that || - ||,, in Example 7.7(#2) is a norm. 


Solution. By Minkowski's inequality for infinite sums (Problem 
1.7b), 


oo 1/p lee) 1/p oe) 1/p 
Pineau < [Slat] + [Siw] 


i=1 


which is the triangle inequality. O 


7.7 Show that the pointwise limit x of the sequence {a} in 
Example 7.7(i7) is also an /?-limit. 


Solution. We show that x") — « in the I? norm. Because {2;} is 
Cauchy, for each e > 0, there is an N(e)-tail of {a;,}. Hence for all 
mt 2 IN, 
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* ge 


Dea -a 


and consequently, 


. fee )_ gl" < oP p= 1,2)... 
Letting m — oo we have from the last equation 


; p 
Deealee” =| <e?, r=1,2,...,n>N 


and then with r — oo we still have 


Thus, x”) — x in the /?-norm. i) 
8.2 Prove that the function || - ||, in Definition 8.10 is a norm. 


Solution. (i)z =0 @alla;=0 => |lz||,=0 oz =89. 


(4) |lyell. = Calva’ = billell.- 

(i) la + yl =a (wt+ BBM, 
= Tylor + Bl 
= Thao + 6) +B.) = Eh [lel + 1a? 
+ aRe(aidi)] < Tt, flea! + 1A + 2lallel 
= Th (lail| + Bil)” 
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(by Problem 1.6, Hélder's inequality for finite sums: for conjugate 


exponents p= q = 5 and a1,...,@, > 0, and b;,...,b, > 0 
Z . 1/2r » 41/2 

diaibi < ba ba ) 

i=1 


On, la + | 


+a elas? YORI? 
= (lle, + ly”. o 


Problems 8.4-8.6 offer an alternative proof to Theorem 8.9 by 
showing that any norm || - || on _X is equivalent to the standard norm 


Il - Il. 

8.4 Let (X,||- ||) be a finite-dimensional NVS and || - ||, be the 
standard norm defined in (8.13). Show that there is a positive 
constant / such that for each x € X, 


lal] < M|ll,.- 


Solution. Let {b;, ...,b,} be a Hamel basis for X. Define 


M = De [leill” > 0. 


Then if || - || isa norm on X, 


[lel] = d2earaubill S Di leulllball 


and by Hélder's inequality (Problem 1.6) 


< My Dele’ = Mllall,. O 
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8.6 Let (X,|| - ||) be a finite-dimensional NLS. Show that there is 
a positive K constant such that 


Kll- lle <ll- Ib 


Solution. Let (mj,...mn) € $(0,1) be a minimum point of f 
proven to exist in Problem 8.5. Let 


K:= [do midi|| = f(mi, cig Tg) 
Then, 7 = fics): 
Furthermore, if = >;_,a;b; € S(0,1), then |||, = 1 and 


I|=|| =F iy acting tig} eae 


Therefore, for each y € X\{0} (because | wh |. = i) 


jal « 
yl. || = 


or 
llyll > ly ll.A; 


which also holds for y = @, thus on entire X. 


Because || - || and || - ||, are equivalent, any two norms on X are 
equivalent by transitivity (Proposition 8.5). O 


8.8 Let Y be a finite-dimensional subspace of an NVS (X, || - ||). 
Suppose there is a bounded sequence {y,,} C Y. Show that 
this sequence has a subsequence convergent in Y. (This gene- 
ralizes a known result from calculus that any bounded se- 
quence in IR” has a convergent subsequence. ) 


Solution. If {y,} is bounded, then there is a closed ball C(0, M) 
such that {y,,} C C(@,M). Because by Corollary 8.2, Y is closed in 
X and any closed ball is also closed, YC(0,M) is closed. 
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Obviously, {y,} C Y NC(@,M). Because Y has the Heine - Borel 
property and Y 1 C(0,M/) is a closed and bounded subset of Y, 
Y 1 C(0@, M) is compact. Thus, by Theorem 6.3, {y,} must have a 
convergence subsequence in Y. O 


Chapter 3. Elements of Point Set Topology 


1.2 Let X be a nonempty set and 7 = {X,0,C° :C C X and C 
is finite}. Show that r is a topology on X. 7 is called the 
cofinite (or finite complement) topology on X. 


Solution. We exclude the trivial case when X is a finite set, because 
T becomes a discrete topology. Then, in the general case, 7 is filled 
with infinite sets whose complements are finite sets. Obviously, T 
contains all unions of such sets. This is an easy-to-verify claim with 
De Morgan's law. Indeed, if {O; : 1 € I} C7, 


(uo) =o: 


el ie. 


must be a finite set as the intersection of finite sets. Furthermore, 7 
must be closed relative to finite intersections, because again by De 
Morgan, 


(o.) = Uo: 
i=1 i=1 


must be finite as a union of finite sets. Then, its complement is an in- 
finite set. O 


1.4 Prove Proposition 1.6. A C X is a neighborhood for all of its 
points if and only if A is open. 


Solution. (i) If A is open then the statement follows immediately. 
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(ii) Suppose A is a neighborhood for all of its points. Then, for each 
x € A, there is an open set O, C A. Thus, J O, is an open set and 


LEA 
it is also a subset of A. On the other hand, A = L) {x} and it is a 
ZEA 
subset of (J O,, because {x} C O, for each x. O 


rEA 


1.6 Prove Proposition 1.9. The closure of a set A is the smallest 
closed set containing A. Furthermore, A is closed if and only 


ifA=A 


Solution. (i) Show that A is closed or that A° is open. Denote 
ExtA = Int(A°). Then, « € ExtA = JU, C A®. But then this U, 
does not meet A and thus U, C A’. This happens if and only if 
a € A®. Because ExtA is by definition an open set, then so is A° and 
thus A is closed. 


(ii) We show that A is closed if and only if A = A. If A is closed, 
then obviously A is the smallest closed superset of itself and thus 
equals A. Conversely, if A = A, then A is closed. 


(iii) We show that A is the smallest superset of A. Suppose there is 
a closed superset F' of A strictly smaller than A. Assume that A is 


not closed [or else by (i) A is the smallest superset of A], then 


AC F. Then, Int(A°) = A° Cc F°. Because Int(A°) is the largest 
open subset of A° and because A C F = F® C A®° we have 


Int( A?) = A CFC Ae, 


which shows that F’° is strictly larger than /nt(A‘°), which is im- 
possible. O 


1.8 Show that A= AUQOA. 


Solution. 


DAC AS XaAULC AUL =X, 
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2)From1) => AC AU AC=X => AN[AU AJ=A. 
3) A= AN X (from 2)) = AN (AU A?) 
(by the distributive law) 


=AU (AN A*2)=AU OA (by definition). O 


1.10 Show that A = 44 0A. (Hint: Proceed in the same way as in 
Problem 2.11, Chapter 2, and work with a neighborhood in- 
stead of a ball.) 


Solution. 


a) We prove that An AC = @. Let x € A. Then, there is an open 
neighborhood O, C A > O,N AS =O => z¢ A. 


b) We show that A + A* =X (the carrier), which would be 
equivalent to A = (Ae)*. Obviously, x ¢ A® & JO,N AS =@ 
&O,CAS re A. 
c) Prove the statement by the following steps. 
A=AnX=An(A+ A) 
—~A+(AnNA) = A+ OA. Oo 
1.12 Show that A = AU A’, 


Solution. Because A’ and A are subsets of A = A’U A C A. On the 
other hand, for each x € A, each neighborhood U, meets A, then 
either x € A,orelsex ¢ Abutr € A!) > AC AUA’. O 


1.14 Show that 0A = @ if and only if A is open and closed. 


Solution. 
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1) A is open and closed implies that A = A and A‘ = A°. Because 
OA = AN A‘, we have OA = AN AS =@. 


2) Let OA = AN A® = @. From 


@=ANAC=ANAP CANA =Q@, 
it follows that AN AC = AN AS. 


Now, AN A° C AN Aé =@. Thus, A\A= AN Ao =@Q, that is, 
A= A, for A C A. Analogously, we show that A° = A°. oO 


1.16 Show that the inverse inclusion in the previous problem holds 
if and only if A is closed and open. 


Solution. Clearly, 


Ae C (Ay? & Ae\(A)Y’ = @. 


Now, 


Ac\(A)’ = ACN A=OA=GO 
(by Problem 1.14) <= A is open and closed. O 


1.18 A subset of a topological space (X,7) is a Gs set if it is a 
countable intersection of open sets. A subset F’, is defined as a 
countable union of closed sets. Show that G's can be represent- 
ed as a countable intersection of a monotone nonincreasing se- 
quence of open sets. Likewise, show that F’, can be represented 
as a countable union of a monotone nondecreasing sequence of 
closed sets. 


Solution. If Fe =U , then, with F, = U Fi, we have 
k=1 k=1 
Fy =|) Fe O 


n=1 
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1.20 Let U/, be a neighborhood system at x. Show that for each 
U €U,, there isaV € UU, such that U € U, forally € V. 


Solution. Given U € U,, let V = U, so for each y € V = U, U isa 
neighborhood of y and thus U € Uy. L 


1.21 Let B, be a neighborhood base at x. Show that if A, B € B,, 
then there is a base neighborhood C' € 6, such _ that 
CCANB. 


Solution. Because B, C U, and U, is M-stable, by Problem 1.19, it 
follows that AN B € U,. By the definition of B,, then AN B is a 
superset to at least one base neighborhood of x that we can denote by 
C. O 


1.25 Let S C P(X) be a collection of subsets of a nonempty set X. 
Show that there is a topology containing S. Also show that 
there is the weakest topology containing S, in notation 7(S). 


Solution. Clearly, there exists a topology containing S [for instance, 
P(X)]. Then define r(S) as the intersection of all topologies 
containing S. We show that 7(S) is a topology on X. 


(2) X and @ belong to all topologies containing S. Therefore, X and 
© belong to 7(S). 


(ii) Let O,,Oo,...,On € T(S). Then O1, Oo,..., On are elements 

of every topology containing S. This implies that (} O; belongs to 
k=1 

all topologies containing S, and thus it belongs to T(S). 


(277) By similar arguments, 7(S) is closed relative to the formation 
of arbitrary unions. 


Obviously, 7(S) is the weakest topology containing S. O 
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2.2 Show that BC P(X) is a base for a topology on X if and 
only if two conditions are met: 


(i) Each 2 € X belongs to at least one set Be B (or 
equivalently, X = |) B) 
BeB 
(iz) VB,,B,y€B and Ve € Bi N Bo, ABEB such that 
tEBCB, NB». 


[Hint: Use the steps that follow. 
1) If B is a base, then apply Theorem 2.2 (iz). 


2) letr=< 1) Beye Cc By. Show that 7 is a topology 
BeB' 


on X and that B is a base for T.] 


Solution. 


1) Let B be a base for (X,7). Then statement (i) follows from 
the fact that each open set, in particular, X, can be represented as the 


union of base sets. Statement (72) is due to the following. Because 
B,, By € B, By N By = Bi, with Bi € B. Thus, for an x € By N 


By, there isa B € {Bi} such that BC By NM By. 


2) Let (i) and (iz) hold and let 


r={ U Bive'cBh. 
BeB' 

Then, 7 is a topology on X and B is a base for r. Indeed, from (7) it 
follows that X € 7, also 7 is closed relative to the formation of all 
unions. It remains to show that the intersection of any two elements 
of 7 belongs to 7. If O,; ,O2 € T, then O, = UB*, k= 1,2, for 
some {B*} C B (the index k is not the one over which the union is 
formed). Let 


gEO,N0,=(U BY) Nn (U B?). 
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Therefore, there is at least one set B’ € {B'} and one set B” € 
{ B?} such that x € B’ NM B”. By assumption (ii), 


1n,-C BOB eb, 6h iB" and 5b, eB. 


Hence for each x € O,; Ono, there is a B, € B such that B, C 
OM Oz, which implies that the latter is an open set, and 


OpfiOe=. |) By 


tEO NO» 


Thus, 7 is a topology on X. The fact that B is a base for 7 follows 
directly from the construction of 7. O 


2.4 Let B, and By be bases for the topologies (X,7,) and (X, 72), 
respectively. Prove the equivalence of the following statments: 


(i) For each x € X and each base set A, € 6, that contains 
zx, there is a base set B, € By which contains x and 
Bae Ax, 


(it) 72 is finer than 7}. 


Solution. (i) => (ii). Let By € By and let x € B, be any element of 

B,. According to assumption (i), there is a base set Bo, € By that 

also contains x and is a subset of B,. Thus, any B, can be “filled” 

with base elements from 62, that is B,; =) Bo, which makes B, a 
reEB, 

T. element. In other words, B, C 7. By Theorem 2.4 (7), then 

TL Cc T2. 


(ii) = (i). Let x € X. Then, there is a base element B, € B, that 
contains x. Because 7; C T2 = By, € T2. Now being an open set in 
T2, B, can be represented by a union of base sets from bg, at least 
one of which contains x. Thus, there is a By C B, that contains x. 


2.6 Show that the collection of all sets on the real line of types 
(a, oo) and (—oo, b) is a subbase for the usual topology (R, 7.). 
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Solution. Obviously the union of all intervals of type (a, oo) and 
(—oo,b) gives R. Thus, their collection S is a subbase for 7(S). 
Now, if we extend S by including in S all finite intersections, we 
arrive at the base B for 7(S), in accordance with Theorem 2.8. On 
the other hand, G6 will include intervals (a, b) which are known to be 
base sets for 7, (the usual topology). Therefore, 7, and 7(S) share the 
same base sets and hence must coincide. The latter yields that S is a 
subbase for 7.. O 


2.8 Show that the usual topology 7. on R” and the product 


topology Tp generated by x 7) coincide. 


Solution. By Example 2.3(ii7), R (the set of all base rectangles in 
R”) is a base for (R",7.). Recall that a rectangle of type R 


= [[(a;,b;) is a base or simple rectangle. In other words, 7(R) 
i=1 
=7 


On the other hand, 
RG Te he CO Te S Tie Kt Re 


Thus, by Theorem 2.4 (i), T(R)=7 Crp. In addition, the 
generator of Tp, 


Te X +++ X Te C Te. 


Indeed, O; is a union of open intervals and equals |)(a;, b;). Thus, by 
Problem 5.2(2i), Chapter 1, 


O7; x Oo = U(O; x In) 


= U(|Ua] % h) = Uh x £2). 


By induction, O; x --- x O, 1s a union of open base rectangles and 
thus is open in 7. 
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Having shown this, if we use 
T1 = Te, 72 = TP; 
B, = FR, and Bo = Te X +++ X Te, 
we have by Theorem 2.4(i77) that 


(t= = Tp. CO 


3.1 Prove Proposition 3.5. Let (X,7) be a Hausdorff topological 
space. lit, =a; and lef li, 4%, = y. Then a = yf. 


Solution. x, — x < Wneighborhood U,, 4k such that 
4 eC ass b CS Uys 

In — Y & V neighborhood U,, 4m such that 
Lag mtiee yp GC Uys 


If «Ay then AJ U, and U, such that U,NU,=@. Let 
s = max{k, m}. Then, 


1g Ee UG and i Da 
which is impossible because U; 1 Uy = ©. O 


3.3. Show that any metric space is first countable. 


Solution. By Example 2.3(ii), B, = {B(z,q): ¢€ Q, gq > 0} is a 
neighborhood base at x € X and it is countable. O 


3.5 Is it true that any first countable topological space is also 
second countable? 


Solution. No, it is not. For example, (R,P(R)) is the discrete 
topological space. This is first countable (specifically, because it is 
metrizable). However, it is not second countable. For each x, {x} 
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must be a base set, and obviously, B = {{x}: « € R} would be the 
“smallest” base for P(R) and it is not countable. O 


3.7 Show that subspaces of second countable spaces are second 
countable. 


Solution. Let (X,7) be second countable, let (Y,7’) be a subspace of 
(X,7) and let B be a countable base for 7. We will show that 
B’:=BOY:={BnyY: Be B} is a base for (Y,7’) and that the 
latter is second countable. If O’ € 7’, then O' = ONY for some 
O © T. Now, because 


O = U B;, 
some B;EB 
we have that 
= U (BY). O 
some B,NY €B’ 


4.3 Prove Proposition 4.4. Let (X,7T), (Y,m™), and (Z,7) be 
topological spaces and let f: X — Y and g:Y — Z be 
continuous functions. Then the function go f:X — Z is 
continuous. 


Solution. Let H € tT. Then by Theorem 4.2, g*(H) € 7, that is the 
inverse image of H is open in Y. Again, applying Theorem 4.2, we 
have f*og*(H)€7, now open in X. Finally, recalling that 
(go f)-!| = f+ og we complete the proof of the proposition. O 


4.5 Under the conditions of Problem 4.4, set f(z) =x+1. Is f 
continuous? 


Solution. It is easily seen that f*(R) =R, f*(O) =@, f*(y) = 
y-l = f°2}) = {1} €7, f*(2,4)) =[L3) € 
tinuous. O 


ss) 
4 
SYS 
a 
fe) 
° 
ce 


4.7 Under the condition of Problem 4.6, assume that r’ = r(d), 
that is, (Y,7’) is a metrizable topological space. 
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a) Show that f is continuous at x € X if and only if the 
inverse image under f of any open ball Ba(f(x),¢) is a 
neighborhood of x. 


b) Show that, for each open ball Ba(f(a),¢) there is a 
neighborhood U,,(€) such that 


f(Ux(e)) © Ba(f(z), €). 


Solution. a) The statement of this problem follows from Problem 4.6 
as a special case where any ball centered at f(a) belongs to a neigh- 
borhood base at f(x). 


b) By a), f*(Ba(f(«),€)) is a neighborhood of x. By Problem 2.6 
(Chapter 1), 


fx° f*(Ba(f (2), €)) c Ba(f (2), €). 


Thus, f*(Ba(f(x),e) is such an appropriate neighborhood U,(e) 
and b) follows. O 


4.8 Let f: (X,7) — (Y,||- ||) bea map, where Y is an NVS 
over a field F. Show that f is continuous at x € X if and only 
if, for every ¢ > 0, there is a neighborhood U,.(€) € U, such 
that for each y € U;(e), || f(x) — f(y)|| < e. 


Solution. This is a corollary to statement in Problem 4.7 b). Indeed, 
4.7 b) is equivalent to the statement: “V y © U,(e), it holds 
d( f(x), f(y)) < .” Because d is TIH, 


= || f(z) — fy) lla <e. oO 


4.11 Prove Proposition 4.14. Let f,g: (X,7) — (Y,7’) be two 
continuous maps that coincide on some dense set in X. If 
(X,7) is first countable and if (Y,r') is Hausdorff, then 
f=gon x. 
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Solution. By Lemma 4.13, the set S = {a: f = g} is closed, because 
SisdenseinX > S=S=X. oO 


4.13 Show that (IR,7.) is homeomorphic to (—1,1) with the cor- 
responding relative topology on (—1, 1). 


Solution. The function f(x) = tan(22) is arelevant homeomorphism 
between (R,7,.) and (—1,1). O 


5.1 Let (Y1,71) and (Y2, 72) be topological spaces, let (Q, 7p) be 
the product space, and let F; € 7,2 = 1, 2. Show that fF, x Fy 
S Ts 


Solution. By Problem 5.1(ii) of Chapter 1, (F\ x Fy) = (Ff x Y2) 
U (Y x FY) implying that the latter is the union of two open base 
sets in Tp. Thus, F x F» is closed in Tp. O 


5.3. Show that Int(A x B) = Ax B. 


Solution. (i) Clearly, Ax Bis open in Tp and because A C A and 
BCB, it follows that Ax BC Ax B.But Int(A x B) is the 
largest open subset of A x B, whereas Ax Bis just an open subset 
of A x B. Thus, ARBC Int(A x B). 


(ii) The inverse inclusion. Because /nt(A x B) is open in Tp, for 
any point € = (x,y) € Int(A x B), there is an open neighborhood 
O¢ which is in Int(A x B). As an open set in Tp, 


Oc =U O71 X Oo. 


O1ET,02€T2 


Therefore, € belongs to one of these open sets, say O; x O2. Hence, 
x € O; and y € Os. Obviously, O; C A and O2 C B. Thus, x and y 


fe) ie) 
are interior points of A and B, that is, € € A x B. O 
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5.6 Let (Yi,7;),i = 1,...,n, be topological spaces and (0, 7p) be 
the product space. If A; C Y;,2 = 1,...,n, show that 


[[ 4: = [[ Ai. 
i=l i=1 


Solution. (i) We first show that 


One can show part (i) in exactly the same way as in part (i) of 
Problem 5.21. Alternatively, one can show it through the following 
steps. 


Step 1. If f is a continuous function, then f*(B) C f*(B). 
Indeed, f*(B) C f*(B) and f*(B) is a closed superset of f*(B), 
whereas f*(B) is the smallest closed superset of f*(B). 


Step 2. Recall from Problem 1.7 that AN BC AN B, which can 
be extended by induction for any finitely many sets. 


Step 3. From Step 2 and then Step 1, Because the projection maps 
7;'s are continuous, it follows that 


(\nr(Ai) © (\ae(AS (Step 1) € (3A), 


i=1 i=1 i=l 


thereby concluding part (7). 


(it) Show the inverse inclusion. Let 


J 


a 


f= (Yigex+4 Hn) = Ai. 


Thus, y; € A;. Let Og be any open neighborhood of € in (2. There- 
fore, it can be formed by a union of open base rectangles of type 


n 
[| O;. Because € belongs to the union, it must belong to one of these 
i=1 
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sets, say to I O,,. Consequently, O,, is an open neighborhood of y; 
in 7; and ae y; € Aj, it follows that O,, must meet A;. Thus, 
I O,, must meet I A; implying that € is a closure point of Il Aj. 
The latter acne ak (iz). = O 


5.8 Prove Proposition 5.1. Let 
B=B,x---xB,: = {B,x.---x B,: B; € B;} 


where B; is a base for T;, 1 =1,...,n. Then B is also a base 
for Tp. 


Solution. Let O€Tp. Because B= {Oj x--- xX On: O; © Ti, 
i =1,...,n} isa base for rp, 


O= (J oul, 


jel 
where Ol] = ol x» x OW € Bando! c T;, 8=1,...,n. 
If Bb, is a base for 7,, then we have 


of = U BP with BM € B,, s=1,...,n. 


isEl, 


Now, taking into account that 


oll x. x olf = (\n; (ol) 


s=1 


by the distributive law, we have that 


jeJ jet |s=1 is€l, 


o=UoU=U Are ( U nt) 
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=U U... U {mi(Bi!) n... nm (Be)} 


jet yeh in€ln 


=U U... U (Bl x. x Bi). 


jet Hel in€In 


The latter shows that every open set from Tp can be represented as 
the union of base parallelepipeds. O 


5.10 Prove Proposition 5.3. Let 
S = {ai (Sj): vipsee eo Lisseg tts 


where S; is a subbase for 7;. Then S' is a subbase for 
Tp S718). 


Solution. In equation (P5.2) we replace | | Og with () (s®). 
k=1 k=1 


Then B is constructed as in Problem 5.1 (from those G;'s) is a base 
for Tp. LC 


5.12 Let f: (2,7) > (a =|] ¥i,7») and 17;:(2— Y;. Show 
i=l 
that the function f is continuous if and only if each 7; 0 f is 
continuous. [Hint: Show that f*(S') € 7, for every subbase 
element of rp, and then apply Theorem 4.8.] 


Solution. 


1) If f is continuous then 7; 0 f is also continuous, for the 
composition preserves continuity. 


2) Let each 77; 0 f be continuous and let S = S; x --- x S;, bea 
subbase set in Tp. Then 


rs) =F (A ai6)) =A (mo AS) 
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Because 7,0 f is continuous, it follows that (a; 0 f)*(S;) €7, 
k=1,...,.n => f*(S)€7. By Theorem 4.8 it is equivalent to 
continuity of f, for we showed that f*(S) € 7 for an arbitrary 
subbase set. C 


5.14 Let (Y,,7,),2 © X, be an indexed family of topological 
spaces and (2, Tp) be the Tychonov product space. Let A be 


an arbitrary subset of X and fF, € 7°,x2 € A. Show that [| F, 
zEA 
is closed in Tp. 


Solution. 


(T14:) = (Qa) =UmteD, 
LEA LEA “EA 

which is a union of open sets. In fact, each 7*(F°) is a base open set 
in TP. im 


5.16 Let (Y:,T,),x © X, be an indexed family of topological 
spaces such that S,. is a subbase for 7,, x2 € X. Show that the 
collection of all unit subbase cylinders 


S=U mi (Sr) = {77 (Sr): Sr € Sr, FE X} 
cExX 


[in Remark 5.8(2) and Remark 5.10] is a subbase for the 
Tychonov topology. 


Solution. 1) Suppose S,, is a subbase for the factor space (Y;, 7,) and 
B,, is the associated base for 7, in Y,. According to Problem 5.15, 
simple cylinders such as 77, (B..,) Peete tia. (B,, ) form a base B 
for Tychonov topology rp. To simplify the notation let B € B be in 
the form 


B=n(By)N--N1*(B,). (P5.16) 
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Now, because B, is a base set in 7,, it can be represented through 
subbase elements as 


By =) ae y= 1,..247, 


dyELy 


where I, is a finite subset of N, i, means a variable index associated 
with y, and oe € S,. Thus, from (P5.16) we have 


B=mi( SY |n--aml nsf 
HE], in€In i 


=) £5 (5) n---raa(sir))., (P5.16a) 


HE], in€lIn 


So, B ends up being a finite intersection of simple cylinders of the 
form 


r (sf) AeA (sf) (5.16b) 


on 


that are made of subbase sets. Thus, B = [| R., where 
vExX 


Rox S,, tEACX 
") Y¥;, we AS 


with A as a finite subset of X and S.. as a subbase element of S,, C 
T,. According to Theorem 2.8, finite intersections of subbase 
elements (simple cylinders) of type (P5.16b), form a base 6 for the 
weakest topology 7(S). Because B = B(S) and because rp and 7(S) 
share the same base, by Theorem 2.4(77), they coincide. 


Notice that simple cylinders are made of unit cylinders. The latter 
can be utilized as more rudimentary subbase elements. Finite 
intersections of them as in (P5.16b) lead us to base B through 


604 CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 


(P5.16a). Thus unit cylinders also serve as a subbase for Tp. In other 
words, 


S = U 1 (Se) 


cEX 


is a subbase for Tp. 


Finally, we have 


Q = 13(¥r) = 13 (U s) =Ut (°°) 


ie. ie. 


proving that S indeed is a subbase for Tp. O 


5.18 Let (0 =|] ¥. r») be the Tychonov topology and assume 
rex 


that each factor space is first countable. Is (Q,7p) first 
countable if: 


by |x| > ¢? 


Solution. Let Og € Tp be an open neighborhood of a point € € 122. 
Because 


p= {n,( 110.) x24} 


rEA 


is a base for Tp, O¢ is representable in the form (J B for B’ C B. 
BeB' 


Thus, € € some Be of this union which is a simple cylinder of the 
form 


n 


Be= te, an (i! 0, = mh (On) M148 (Ox,). 


i=1 
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Now, because each Y, is first countable, there is a countable 
neighborhood base Bez) = Eee eA 
&(a) of € such that there is some At” C O, for any neighborhood 


O,, of €(x). Repeating the same process for each x; € {21,...,2n}, 


we have that oT om (i Ag is a neighborhood of € which is a 
eo 2 i=1 eat 


\ at the x-coordinate 


subset of Be and thus of O¢. 


Now we form the neighborhood base at € as follows. Let 8 = 
{ Bex) :crEX } and let Bg be the set of all finite Cartesian products 
of sets from %. If X is countable, then according to Problem 6.14, 
Chapter 1, Be is countable. From the construction of Be and due to 
the nature of the Tychonov topology it follows that Be is a 
neighborhood base at € and it is countable if so is X. Indeed, because 


the choice of O¢ was arbitrary, we could pick out a finite set 
\Arayt =], aah) from neighborhood bases at each pertinent 
coordinate €(), with x € A. 


However, Bg is not countable if X is not countable. i 


5.20 Let ( =| [3 r») be the Tychonov topology and assume 
cEX 
that each factor space is second countable. Is (0, Tp) second 
countable if X is countable? 


Solution. Yes, 91 is second countable. According to Problem 5.15, Tp 
can be generated by the base 
B= {m( I] B.) : A's are finite subsets of X, B, € a, 


reEA 


where &,, is a countable base for 7,. Therefore, B is induced by the 
set 


A: ={ x a,:4cx,A<N} 
zEA ~ 
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in agreement with the following specification. Given an n € N, a set 
A= {1,...,%} € X is selected. According to A, n factor spaces 
Yr,,-++,Yx, are chosen and the corresponding bases ®,.,...,B,.. 
From these bases we pick out respective elements B,;,,...,Bz;,,, one 
from each followed by the formation of the cylinder 


a= cee (112. ) -_ 7, (Bay) hese fi Ty: (Be) Ee B. 
? wn fai n 


The selection process described above is identical to that in Problem 
6.14, Chapter 1, according to which the set 2( must be countable. O 


5.22 Let (a =|] ¥, r») be the Tychonov topology and assume 
rEX 
that each factor space is separable. Is (Q., Tp) separable if X is 
countable? 


Solution. Yes, it is. Let D,, be a dense set in (Y,, 7) and consider the 
set {D, :« € X}. Let 


{Dig So Dy, tne Nitin et, |} CX} 


be the set of all finite Cartesian products from {D, : x € X}. Now, 
if D,, X---x Dz, is picked out, then we select a point 
{dy,,.--,dz,} € Dz, x +++ x D,, and include it in set D. According 
to Problem 6.14, Chapter 1, the set of all such points is countable. 


Finally, we define the set D as follows. Let d € [] D, be a fixed 


LEX 
arbitrary point from this product. Given a set A = {2 ,...,%,} C X 
and the associated point {d,,,...,d:,} € 9, we complement 
{d,,,.-.,dz,,} to a point in 2 by the elements d, from d, with 


x € A® and denote such a point by d4. Consequently, 
D={d42 A= {ai5.225a%,} CXA,0EN} 2D 


and thus D is countable. 
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Now we show that D = 2. To prove this statement we need to 
verify that for each € € (2, any neighborhood ©; meets D. As usual, 
there is a base neighborhood O being a subset of O¢ in the form 


OT gt (iL Oras), where Og(a,) € Ta; 


and being a neighborhood of €. We recall that €(x) is the ath 
coordinate of € in the factor space Y,,. Now, because D,, is dense in 
Y,, Og) meets D, for each x € X, in particular, for each x € A = 

{£1,...,%}. Therefore, each Og(,,) contains at least one point of 


D,,, say dy, Therefore, []Og(,,) meets the set {dz,,..., dz, } which 
1 


by our construction must be included in 9 and therefore, O includes 
the point d, and thus meets D. Consequently, O¢ meets D. O 


6.1 Is the property of a set to be dense vaguely hereditary? (Con- 
sider Example 6.4.) 


Solution. Although we know from Example 6.4 that the property of a 
set to be dense is not hereditary, we can show that it is vaguely he- 
reditary. Let Y be open in (X,7) and let D be a dense subset of X. 
Recall that then [cf. Definitions 1.8(7-22)] for each x € X and any 
neighborhood U,, of x, U;, D # @.) We claim that DM Y is dense 
in Y. If this is not the case, then there is a y € Y and an open 
neighborhood U, of y in (X,7) such that Uy NY 1 D=@ (as we 
have seen it in Example 6.4) and with U, 1 Y being a neighborhood 
of y in the subspace (Y, Ty). 


On the other hand, because U,, M Y remains an open neighborhood of 
y in the original space and this is due to our assumption that Y is 
open in (X,7), it contradicts the very property of D being dense in 
(X,7). Now it is obvious what exactly we missed in Example 6.4 
regarding U, 1 Y: the latter (in contrast to our present case) failed to 
be a neighborhood of y in the original space (X, 7). O 


6.3 Show that the topological space in Problem 6.2 is not second 
countable. 
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Solution. According to Problem 6.2, (X,P(X)) is not separable. By 
Problem 3.6, a nonseparable space cannot be second countable. O 


6.5 Prove that the Moore plane is not second countable. 


Solution. To see why the Moore plane is not second countable, we 
observe that according to Problem 6.3, any discrete topological space 
with an uncountable carrier is not second countable. Because second 
countability is hereditary, the Moore plane is thus not second count- 
able. O 


6.7 Let 7 be the cofinite topology on an arbitrary nonempty set X. 
Show that (X, 7) is compact. 


Solution. Recall that (X,7) is cofinite if r= {X,0,C°: C ~ N}. 
To avoid triviality, we assume that X is an infinite set. Let C be an 
open cover of X and let some C’° € C. Then, because C’° also 
belongs to 7, by definition, C is finite, say, {x1,..., 7}. Now, there 
is yet another set Cf € C, which contains at least one point, say 7}. 
(if no such set of C is available, then C is not a cover of X.) Then, 
CNC, C {a2,...,2n}. Continuing with this process, it is easy to 
see that there is a C¢ € C such that CN C19... AC, = @ and thus 
{C°, Cf,...,C} is a finite subcover of X, which proves that X is 
compact. O 


6.9 Let {Mi(k),...,M;,(k),S,} be a cover of a set X for each 
k= 1.0.4 Show that4{244 (kh), i, Mae k= 1, 1.5%, and 


() S;,} is also a cover of X. 
k=l 


Jk 
Solution. By the assumption, X C ( U 7) US}. Thus, 
i=l 


( v 7) U s, 
f=1 


xen 
k=1 
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= (A U us) U (A si) 


C (U Ue U (A st). i 


k=1 1=1 


6.11 Let O be an open set in R”. Show that there is a monotone 
increasing sequence {O,} 7 of open bounded subsets of O 
such that {O;}7 O. 


Solution. Obviously, V x € O, there is B(x,é,) C O, and hence 

O=U B(a,<,). Because R" is Lindeléf and so is (O,t 1 O) 
xzEO 

(vaguely hereditary), the above cover, {B(x,¢,): « € O}, of O can 

be reduced to at most countable subcover, i.e., 


O=|) BW é,,) 
k=1 


j 
Denote O; = U B(a, €x,). Clearly, O; C O is an open bounded set 
k=1 
and {O;}T O. O 


6.13 Let (X,7) be a Hausdorff space and C' and K be disjoint com- 
pact sets. Show that there are disjoint open supersets, U and V, 
of C' and K, respectively. 


Solution. If one of the compact sets C' or K is empty, then the proof 
is trivial. Let C = {x} be a singleton. Because X is Hausdorff, for 
each y € K, there is a pair U,, and V,, of disjoint open neighborhoods 
of x and y. Now, for all y's of K, we have thereby two systems 
{U;,, Vy,} of open sets, of which {V,,} is an open cover of AK and 
that due to compactness of K can be reduced to a finite subcover, 


nm n 
say V,,,---, Vy,- Hence the sets VU = ()U,, and V =) V,, are those 
k=1 k=1 


with the desired property. Suppose now that C’ has more than one 
point. Then, for each x € C’, as we showed above, there is an open 
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neighborhood O,, of x and an open superset W,, of K, which are 
disjoint. For all x € C, we thereby form two systems, {O,,,, Wy,, }, 
of which {O,,,,} is an open cover of C’. This cover can be reduced to 
a finite subcover, O;,,..., Ox,. Hence the sets 


U =| \O.. and VH lw, 


m=1 m=1 


are of the required property. O 


6.15 Let K be the family of all compact subsets of a topological 
space (X,7). Under the formation of which set operations is KC 
closed? Give counterexamples of those set operations, which 
are not contained in K. 


Solution. Obviously K is not closed under the formation of differen- 
ces and complements. Let A, B € K and let {O;} be an open cover 
of A U B. Then, {O;} is a cover both either A and B. Because A and 
B are compact, the cover {O;} is reducible to two finite subcovers 
whose union is a finite cover of AU B. Thus, AU B is compact. 
The same applies to any finitely many compact sets. An infinite 
union of compact sets need not be compact. For example, let K,, = 


[=,1—<] in (R,d,). The union LU K,, = (0,1) is not compact. 


n? 


n=1 


A compact set in a topological space need not be closed unless it is 

Hausdorff. If {A;;2 € I} C K is an arbitrary indexed family in a 

Hausdorff space, then, the intersection {]) K; is closed and is a 
iel 

subset of any one of K;. Thus, by Theorem 6.9, it is compact. 

However, it does not hold true in general topological spaces. 


In conclusion, K is closed with respect to the formation of finite 
unions and, if 7 is Hausdorff, under arbitrary intersections. Further- 
more, the empty set is also compact. O 
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Chapter 4. Measurable Spaces and Measurable 
Functions 


1.2 Let R bea ring in (2. Show that with any two sets A, B € R, 
their intersection also belongs to R. 


Solution. The statement follows from 


ANB =(AUB)\(AAB), 


where 


AAB = (A\B)U (B\A). Oo 


1.4 Let A be an algebra in 2. Prove that A is a o-algebra in ( if 
and only if A is also a monotone system. 


Solution. Obviously, any o-algebra is a monotone system. Let A be 
an algebra and monotone system. Let {A,} CA. We show that 


lore) N 

UA, € A. Let By =|J Ay. Then, By € A and because A is a 
n=1 n=1 

monotone system and { By } is monotone nondecreasing, 


CO 


Bne A. O 
N=1 


1.6 Give an infinite collection of subsets of R that contains @ and 
R and which is closed under countable unions and inter- 
sections but is not a o-algebra. 


Solution. Let S={R,@,A,:n€N}, where A, = (—o0,n). 
Clearly, S meets the assumptions of the problem. However, S does 
not contain complements. For example, AS = [n, oo) ¢ S. Thus, S 
is not even an algebra, but it is a monotone system. 


1.8. Give a Dynkin system that is not a o-algebra. 
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Solution. The system in Problem 1.7 is one, because the intersection 
of two overlapping sets does not have to contain an even number of 
elements. O 


1.10 Prove the statement: A Dynkin system D is a o-algebra if and 
only if D is 1 -stable. 


Solution. (i) Assume that D is a M -stable Dynkin system. Let A, B 
€ D. Because D is (-stable = ANB € D. Furthermore, because 
AN BC Awecan apply the result of Problem 1.9, namely, 


A\B = A\(ANB) €D. 


Therefore, an  -stable Dynkin system is closed relative to differen- 
ces. Next, because AU B = A+ (B\A), it follows that a M -stable 
Dynkin system is closed relative to finite unions. 


Now, let {A,} C D. Denote B, =U Ax, n =1,2,..., and Bo = 
k=1 

®. Obviously, {B,} CD and sequence {B,,} is monotone non- 

decreasing. Then, 


U An = U By = ear 6 (P1.10) 
n=0 


n=1 


where C,, = B,\Bn_-1,n = 1,2,... are disjoint. Moreover, C,, € D, 
again by Problem 1.9. Thus, from (P1.10) we conclude that [J A, 


n=1 


e D, 


(it) Let D be a o-algebra. Then, by Problem 1.5, it is also a Dynkin 
system. As a o-algebra, D is ™-stable (based on Problems 1.2 and 
1.5 or Example 1.2(v). O 


1.12 Show the equivalence of two definitions of a semi-ring if 
property c) in Definition 1.1(v) can be replaced with 
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c’) Let A, Aj,...,A,€S. Then there is a finite tuple 
Ci,..., Cy of disjoint sets from S such that 


A\ UA = C) 
Solution. Let S be a semi-ring in the sense of Definition 1.1(v). 
Then, 
A\(A; U Ag) = (A\A1)\A2 
= {yoja1Cj}\ Ae rs jai Ci\A2 
-_ ye ae 
The statement follows by induction. The converse is trivial. O 


2.1. Prove Theorem 2.1. The intersection of arbitrarily many o- 
algebras (algebras, monotone systems, rings) in Q is a o- 
algebra (an algebra, a monotone system, a ring). 


Solution. Let ©’ = (| &;, where {57; : 7 € I} is an indexed family of 
tel 


o-algebras in 22. Then, 
(a) NED, 


(b) If A € X’, then A belongs to all 7's and thus A° € ¥,7 € I. 
Consequently, A° € »’. 


(c) Let {A,} C S’. Then, each A,, € ’; for all i € I and hence, 
L) A, € »}; for all z's implying that the union of A,,'s is also an ele- 
n=1 


ment of 3’. 


The proof regarding the intersection of algebras, monotone systems, 
and rings is similar. O 


2.3. Show that a union of o-algebras in 2) need not be a o-algebra. 
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Solution. Let 3, = {0,@, A, Ac} and X>) = {0,@, B, B°} and let 
A#B. Then, ©, UX» = {0,G, A, A°, B, Bo}. Excluding some 
trivial cases, X; U 3’y is not a o-algebra. i 


2.5 Let »’ be a o-algebra in 2 and let B C (2. Show that the o- 
algebra generated by G = ’ U {B} is of the form 


SY! = {(A,M B)U(Ag\B): At, Ap € Z}. 


[Hint: 1) Show that o(") = o(2’ U {B}). 2) Show that &” is 
a o-algebra in 2.| 


Solution. Denote o(”) the o-algebra generated by ” and 
o(S'U {B}) the o-algebra generated by ©’ U{B}. We prove the 
statements in accordance with the following chronology. 


1) of") =o0(S U{B}); 2) LX” is ao-algebra. 
1) We show that 0(’) = o( XU {B}) 


(i) Notice first that B need not be a measurable set. Thus, the 
smallest o-algebra generated by ©’ and {B} must contain all sets of 
type A, N B, Ao\B, and (Ai NB) U(A2\B) such that Ai, Ar € SY, 
to say the least. Hence, ©” C o( 5’ U {B}) implying that 

o(3") C o( ZU {B}), (P2.5) 


because o(2’) is the smallest o-algebra containing »’ and 
o(5' U {B}) is a o-algebra containing ”. 


(ii) We show the inverse inclusion of (P2.5). We list the following 
properties of 4” as per its definition. 


a) With the choice of A; = 2 and Ay = @, we have B € &”. 


b) If A; = Ag = A, then (A, B)U (A2\B) =A € L" imply- 
ing that 37 C ¥”, 


Therefore, 
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DU{B} CE" > o( SU{B}) Co(E%, 
consequently having 
o(&") = o(S' U {B}) 
and we are done with 1). 
2) We show that X” is a o-algebra. 
(a) Because 7 C &”, it follows that Q € X”. 
(b) Let C € X”. Then, 
C = (A, B)U(A2\B) = (AN B)U(A2N B’). 
By DeMorgan's and distributive laws, 
Ce = (AGN AS) U (AS BS) U (AGN B)U(BNB’). 


Now, because A§ NM A§ C (ASN B°)U(AS AB), which can be 
shown by a pick-a-point process, we have C’© reduce to 


O° = (ASN B*) U (Af NB) = (AEN B) U(AS\B) € Z 
in as much as Af, AS € ». 


(c) Let {C,} C &”. Then, C, = (AT N B) U (AR\B) and 


3 


Ucn ={ Utains) buf Gian} 


n=1 n=l n=1 


= (Wat) ne U (Us) \p ea. 


n=1 n=1 


a2 Ex 


In conclusion, 47’ is a a-algebra. O 
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2.7. Let © be an arbitrary nonempty set and let A,BCQ. 
Construct for the topology 7 = {0,0,A,B, ANB, AU B} 
the Borel o-algebra B(r). 


Solution. Supplement 7 with the following subsets. 
A\B, B\A, A°, B®, AU B®, ACU B, ACN B*, A°U BS, AAB, 


and (AAB)° = (A°UB)N(AUB*). O 


2.9 Show that any open set in R” can be represented as an at most 
countable union of disjoint half-open cubes. 


Solution. For each positive integer k, let $8; be the partition of R” 
by cubes of type [a, b), where 


_ m — m+ = 
ai = oF b; = 2k , m=), +1, FE Di. 


For k < p, clearly 8, < 8, in the sense that each cube of {B), is 
included in some cube of {B,. 


Let G be a non-empty open set in R”. Let S; be the set of all 
cubes from $8; contained in G. If S; = @, we try the same with Py 
and so on. Suppose k is the smallest positive integer for which 
S;, AO. Denote S;, by o;,. From S;,,, select only those cubes that 
are not included in o;, and denote this set by o;,,, and continue this 
process to form the sequence {a,;}. Now let 4; stand for the union of 
all cubes from oj. Clearly, 1; C G and therefore 


We claim that 


Let x € G. Because G is open, there is an open ball B(xz,r) C G. By 
satisfying the inequality 
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in 


oF <7 


with respect to t we can select exactly one cube, say C(x, t), from 
§8,, which contains point x and is included in B(x,r) and therefore 
in G. Surely, the inequality will also be satisfied for t + 1 with one 
cube available from 8;,, which is in G and so on. Each one such 
cube C(x, t) is from o; and thus is included in %:;. The latter contains 
x and therefore x € &. O 


3.1. Show that ¥’ in the proof of Proposition 3.4 is a o-algebra. 


Solution. 

(i) Obviously, 2 € 3’. 

(ii) Let A € 5. Then, f* (4’) = (f*(A))° © 3, because f*(A) 

does. Hence A° € S. 

(tdi) Next, if {A,} C 3 => f*(A,) € Y. Then, 
r(On)-Braves 

implying that 5) is closed relative to countable unions. O 

3.3 Let f :Q — 1 bea function and let G’ C P(N’). Show that 

PEG) = SFG"). 


[Hint: Let 3: = {A' € P(N’): fr(A) EZ 
that 3’ is a o-algebra. Then show that 4'(G’) 


(f°*(G"))}. Show 
Cx] 

Solution. (1) Because G’ C 37(G’), we have f**(G’) C f*(2(G’)). 
Obviously, f**(7(G’)) is a o-algebra in containing f**(G’). 
Because »’( f**(G’)) is the smallest o-algebra generated by f**(Q’), 
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SF" (G')) C f"(2(G)). 
(ii) We show the inverse inclusion. Define 
B= {A’ € P(M’) : fr(’) € LF" (GY). (P3.3) 
We show that 3? is a o-algebra in 11’: 
a) fF(NX) =NE T(FfeG)) sa eL. 
b) Let A’e 3’. Then, f*(A’) € X(f**(G’)) and f*(A”) = 
[fr(AD]o e VS AU ED. 


c) If {AL} OU (F(Al)} C BUEMG)) = U FAI) € 


n=1 


nce) = (U4) € 20") 304, € E. 


n=1 


(iii) We show that G’ C ¥’. Obviously, VG’ € G’, f*(G’) € f'*(G’) 
bearing in mind that 


f(g’) _ {f*(G') : G E G'}. 
Consequently, 


f*(G) € U(f"(G')) good VG eG >Gle XY. 
(iv) From (iii) it follows that 5G’) C ¥? and thus, 


fr(ZG)) c Fr ((Z)) © BF"G’). 


The second inclusion is the direct consequence of the definition of 37 
in (P33). 


So, the dual containment yields the asserted equation. O 


3.4 Let [O,O,, F] be a homeomorphism, with O and O, being 
open sets in topological spaces (X,7) and (X,7,), respective- 
ly, and let B(7,) and B(7¢,) be the Borel o-algebras generated 
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by the relative topologies 7 > and 70,. Prove that [B(70), 
B(To,), F.] 1s bijective. 


Solution. We prove that F..,.(Bo) = Bo, and F**(Bo,) = Bo. If To 
and 70, are the respective relative topologies, by Problem 3.11, 
Chapter 1, 


F (76) = 70, and F**(76,) = To. (P3.4) 
By notation, 
a(7G,) = a(Or imi) = Bo. (P3.4a) 
By Problem 3.3, and (P3.4 - P3.4a), 
F"(Bo,) = F™(e(to,)) = oF") 
= 7(74) = Bo: 


Because F' is homeomorphism, applying the same procedure to F,,, 
we prove that F.,.(7o) = To,- 0 


Chapter 5. Measures 


1.2 Prove Proposition 1.5(i, ii). Every content ys on a ring R has 
the following properties. 


(i) ACB=>p(B\A) = u(B)— (A) [provided _ that 
p(A) < oo]. 
(it) p(AUB) = (A) + p(B) — w(AN B). 
Solution. 


(i) ACB>B=A+4+(B\A). Because ys is a content, thus 
additive, 
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W(B) = (A) + w(B\A). 
Because p(B\A) >0 => p(B) > w(A). Now, if (A) < co, we 
have 

w(B) — p(A) = w(B\A). 
(7) AUB=A+(B\A) =A+(B\(ANB)). Then by (2) and 
additivity of a content it follows that 

(AUB) = p(A) + p(B) — W(An B). O 
1.4 Prove that jz is a measure on ¥’ in Example 1.2(v). 


Solution. Let B, = (J An. Then, {B,,} is monotone decreasing and 
k=n 

U( By) < Yo, (An) < co. Thus, we can apply the continuity from 

above of Proposition 1.5 (i) to the sequence {B,,} with 


oe Bn =, lim An 


so that 
tim, (Bn) = WB) = (lim A,,). 


On the other hand, u(B,) < S07, “(Ag) | 0asn — co [due to 
the convergence of }>~_, 4(A,)]. Therefore, ju(B) = 0. O 


1.6 A subset R; of a ring R in 2) is called an ideal in R if it has 
the following properties. 
a) OER). 
b) BER, ACB, AER => AER. 
c) A, BER, => AUBER,. 


Let yw be a content on a ring R. Define R, ={RER: 
u(R) = 0}. Show that R,, is an ideal in R. 
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Solution. (i) Because ju(O) =0 = OE Ry. 


(ii) Let BE R,. Then (B) = 0. Let A € R such that A C B (at 
least one such subset of R clearly exists). Then, because 
u(A) < u(B), w(A) = 0 and thus A € Ry. 


(it1) Let A, B € R,. Then, 
w(AU B) = p(A)+ u(B) -— w(An B) =0+0-0=0. 


Thus, AU B € R,,. Oo 


1.8 Let R,; beao-ideal in a ring ?. Show that there exists a pre- 
measure 4 on R such that R,.7 = R,, defined in Problems 
1.7 and 1.8. 


Solution. Define 


_ 0, Ae RoI 
yA co, AER\Ro-r. 


We show that ju is a premeasure on 72. 
a) u(O) = Oand pu > 0. 
b) Let {A,,} C R be disjoint such that }>°°, An € R. 


Case 1: {A,} C Roy. Then, by Problem 1.7, S07, An € Roz 
implying that (dp An) = Dona M(An) = 9. 


Case 2: Let J C N be such that {A,,, : nz € J} C R\Re-z. Then, 


int An) = >) H(An) +37 pw(An) = 004+0= 00. 
ned neN\J 


On the other hand, Ae ¢ R,-1, because if it were the case, by 
the definition of an ideal, those A, with n € J, as subsets of 
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>~, An would have been elements of R,.; and the latter was not 
assumed. Consequently, (°°, An.) = 00 and hence 


MO n=14n) = Lnal(An): 


Observe that R,-; contains only elements of whose measure is 
zero. In short, Ro-7 = {A € R: (A) = O}. O 


1.10 Let (©, 5,44) be a measure space and {A,} CX be a 
sequence of sets with “(0 Ay) <oo and inf{p(A,): 


n=1 


n=1,2,...} =a > 0. Show that (lim An) > a. 


Solution. Let B, = (J Ag. The sequence {B,,} is monotone non- 
k=n 

increasing with ,u(B,) < oo. Therefore, by continuity from above 

[Proposition 1.5(iz)] and because obviously ju(B,,) > a for all n, 


lim, (Bn) i u( A Bs) = j( lim, An) = a. O 


n= 


1.12 Under the condition of Problem 1.11, let a = co and ys be a 
finite measure. Show that 1’, = »’. 


Solution. Obviously, G.. =X’ and therefore Y= {Q C2: 
Qn AED, VA€E X}. Because 17 C XY’, it remains to show the 
inverse inclusion. Let A, € X’4,. Then, for every A € 3’, ANA, 
A A‘, and, therefore, their union, 


Ago = (Ago a A) an (Aw a A*), 
are elements of »’. This yields that 37,, C 2’. O 


1.14 Argue that for any probability space (Q, 2’, i), the axiom pu(@) 
= 0 is redundant. Is it also true for any measure? 


Solution. From 
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(B+ O) = w(O) + u(B) = n(@) (P1.14) 


(due to the additivity of jv) it follows that ;1(@) = 0. Thus the latter 
as an axiom of a measure is redundant. However, we managed to 
prove it because ju is a finite measure. In the general case, if j is not 
finite the equation .(@) = 0 cannot be concluded from (P1.14). O 


1.16 Prove Lemma 1.11. Let fy, uo € M(Q, X’). Then the set 
function 


v(A) = inf{~i(£) + wo(A\E): Be Yn A} 
is a measure on S). 


Solution. Obviously, v is a nonnegative set function defined on ¥7 
that equals zero at the empty set. 


Next we show that v is monotone. Let # C AC B. Then, we 
have 


(EB) + e(A\E) < pi (FE) + po(B\E). 
Because this inequality holds true for all EF C A, 
y(A) < jn(E) + p2(B\E) 
and because F' C B, we have v(A) < v(B). 


We need to prove that v is o-additive. Let {A,} C X’ be a 
sequence of disjoint sets with A: = }*>~ ,A,. We first show that 


v(A) > 0 v(An). (P1.16) 


Let E be a measurable subset of A. Then, 


(FE) + p2(A\F) 
= (Ny (An ED) + oD (An EP) 


— yptt(An M E) ale [2(An\(An q E)) 
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= nw (An). 


Because )>*°,v(An) < wi(E) + f2(A\£) holds for all subsets E 
of A, then it also holds for 


inf{j4(E) + w2(A\E): Be YA} =v(A) 


and thereby (P1.16) is proved. 


It remains to show the inverse inequality. Let « > 0 be chosen. 
From the definition of v, given A,,, and any €,, > 0, there is a subset 
E,, © Ap such that 


V(An) > pi( En) + He(An\En) — En, 1 =1,2,.... 


We choose «é,'s such that }°°,¢, <e. Denote H = >°™,E,. 
Obviously, E,,'s are disjoint and E C A and 


A\E = Yin=i(An\ dopa Bk) = Linai(An\En); 
because the rest of E's are disjoint with A,. Therefore, with 
bai(E) + p2o(A\E) 
= Vinita (En) + pa H2(An\ En) 
we have: 
Yinat4(An) 2 Yon=ita(En) 
+ yinai2(An\En) — DinaiEn 
> jn (E) + p2(A\E) — € 2 v(A) ~«. 


Because this inequality holds for each ¢ > 0, we arrive at the desired 
inverse inequality and we are done with the lemma. 0 


1.18 Let (Q, 3’, P) be a probability space and let G C » be a family 
of events. G is called independent if for any finite tuple of 
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events G,...,Gm; (n Gs) = [| P(G;). Let D(G) be the 
i=1 i=1 

Dynkin system induced by G. Show that D(G) is also indepen- 

dent. 


Solution. Define the system of events (measurable sets) as follows. 
D={AC¥: {A} UG is independent}. 


We show that D is Dynkin. an Q € D. Let some A € D and 


let Gy,...,G, € G. Denote G = q G;. Then, events A and G are 
i=1 
independent. Hence, 


P(AN G) = P(ANG) = P(A)P(G) 


= P(A) J] P(Gi). 


P(A @) = P(G) — P(A)P(G) = [1 - P(A)P(G) 


P(A°)P(G). 


Thus, A°,G),...,G;, are independent and A‘ € D. Now, let {An} 
be a sequence of exclusive (disjoint) events from D. We show that 
- ki 
yo 1An € D. Let again Gj,...,G,eG and G=([) G;. Then, 

i=l 
events A,, and G are independent, that is, for each A,,, 


P(A, NG) = P(A, NG) 


= P(A.) P(G) = P(An) TI P(Gi). 


Consider 
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P(GN yen An) = yoni (G N An) 
= Vin-iP(G)P(An) 
= P(G)P(din=1An). 
Hence, D indeed is Dynkin. Now, because G C D, then DG) CG D 


and therefore, D(G) as a subsystem of an independent family, is 
independent. je 


1.20 Let (2, +’, P) be a probability space and let {Gi,...,Gn} bea 
set of n families of events. This set is said to be independent if 
for any choice of events G € Gi,...,G@n © Gn, the tuple {G,, 
...,Gy} is independent. If the set {G,...,G,} is independent, 
show that the set {D(G,),...,D(Gn)} (of Dynkin systems in- 
duced by G;'s) is also independent. 


Solution. Define the following family of events: 
D, ={ACS: {A} UG,...,Gn} is independent}. 


Here G, is enhanced, whereas the rest of G;'s are the same. We show 

that D, is Dynkin. Obviously, 2 € D,. Let some A € D, and let 

Go € Go, ...,Gn € Gn. (Note that A can also belong to G,.) Denote 
n 


G = ()G;. Then, events A and G are independent, implying that 
i=2 


P(ANG) = P(ANG) = P(A)P(G) = P(A)TTP(Gi). 


Now, ACN G = G\(ANG) yields 


P(A NG) = P(G) — P(A)P(G) 
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Thus, A°,G2,...,G, are independent and A‘ € D,. Now, let {A,} 
be a sequence of exclusive (disjoint) events from D,. We show that 
parr. = D,. Let again Gy € Go,...,G, EG, and G= (Ohare 

i=2 
Then, events A, and G are independent, that is, for each A,., 


P(A, NG) = P(A, NG) 


= P(A,)P(@) = P(A,) TT PGi). 


Consider 


P(G M yori An) _ ye P(G q A,) 


= DEPG)P(AY) 


= P(G)P(DL An) 


Hence, DP, indeed is Dynkin. Now, because G, C D,, then 
D(G,) C Dy, and therefore, D(G,) as a subsystem of an independent 
family, is independent. Next, we consider the independent set 
{D(G1), Go, ---;Gn} and do the same with G, by enhancing it to 
D(G2) and so on until we exhaust the whole set. We can use 
induction if we want to. Thus, the statement is proved. O 


2.1 Let j* be an outer measure on P(Q), wp = Resy+pu*, and v* 
be the outer measure induced by 45. Show that p* < v* on 
P(Q). 


Solution. If v*(Q) < oo, then, for every ¢ > 0, there is a cover 
{An} € €(2*) with 


u(Q a) <ui( Un) < Tre sHe(An) < v*(Q) te. 


n=1 


This yields the desired inequality. If v*(Q) = oo, then the inequality 
u*(Q) < v*(Q) holds immediately. O 
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2.2 Let (G,y) be a formatter of the outer measure j.* defined by 
(2.8). Show that if 7 is an elementary content and G is a semi- 
ring, then for each G € G, there is a cover {C,,} of G such that 
G= pares and u(G) = Ge: 


Solution. Take G € G. Thus @% #4 @ and 
w(G) = inf( 1 WGn): {Gn} € GG}. 


Let {A,} © @. Then, by Proposition 1.3(iii) and Corollary 1.4(7) 
there is a countable decomposition }*~~_,C;, of G such that 


n=1 
ey (6.9) = Ae): 
Hence, 
yin (Cn) = inf WG): {Gn} © Set = w(G). O 


2.6 Let yu* be an outer measure on P((Q2) and {A,,} a sequence of 
disjoint j:*-measurable sets. Show that for any Q C Q, 


(Qe An) = Ve (QN An). 
Solution. By subadditivity, 
(QQ 0 Ver An) = wn @N An) < DQ 2 An). 


So, we need to show the inverse inequality. Equation (2.3g) in 
Theorem 2.3 applied to a set Q C Q and {A,,} reads 


u*(Q) > Rw (Q Ag) + wt [ON (OR An)’. 
Replacing Q by QN Y>~~., An in the last inequality we have 


w(Q Yna1An) 
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> Dee (29 (Sri dn) VAs ) 


= A; 


+ H(Q0 QaAn) 1 Opi An) ) 
=0 


which leads to the desired inverse inequality. O 


2.8 Show that »’ defined in Definition 2.5(77) is a o-algebra, fi is a 
measure, that this extension does not depend upon representa- 
tions of sets of 3’, and that (Q, 2’, 72) is complete. 


Solution. 


(i) We show that ¥’ is a o-algebra. Firstly, ©’ C %’. This is obvious, 
because if A € 7, then A= AU @=A€E YD. Inparticular,Q € 2». 


Next, let A € ¥’. Then, A= AU M, where A € X and M is a sub- 
set of some ju-null set, say N. Hence, 


A‘ = A°\M = (A°\N) + A°N (N\M). 


This representation of A° shows that it is a union of a measurable set 
A‘\N and a negligible set AM (N\M). Thus, A® € X. 


Finally, Let {A,} C ©’, where A, = A, U M,, such that A, € ¥ 
and M,, is negligible, n = 1,2,... . Then, 


a= (Gm) u(Om) 


Obviously, the set |) M,, is negligible, which makes the last repre- 


n=1 


oo oe — 

sentation of () A, »'-measurable. 
n=1 

(ii) Let 


[A]: ={AUM: Ae Sand M'sare all negligible sets}. 
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Then, by the definition of 77 [see Definition 2.5(i2)], 


which shows that 7 is invariant of a representation of a 5’-measur- 
able set. 


(iii) We show that for any N EN,, P(N) CX. Let MCN. 
From the definition of »’, foreach AC © = AUM € XS’. We can 
take the empty set as A to see that M € 2’. oO 


2.10 Let yu* be an outer measure on P((Q2) and N C 2 be such that 
p*(N) = 0. Show that for any subset Q CQ, w(QUN)= 
L"(Q). 


Solution. Obviously, 


W(QUN) < #Q)+ WW) = 2*(Q). 


The inverse inequality follows from the monotonicity of the outer 
measure applied to Q U N and its subset Q. O 


2.12 Let ju* be the outer measure generated by an extendible o-finite 
formatter (G,y), 2’* be the o-algebra of all ju*-measurable 
sets, and let 3’(G) be the o-algebra generated by G. Show that 
for every A* € X*, there is a set B € X(G) with B C A* and 
p(A*\B) = 0. 


Solution. Because (A*)° € * and (G,7) is o-finite, by Corollary 
2.18, there is a set B® € »’ such that B® D (A*)° and p*(B°\(A*)°) 
= 0. The assertion of the problem follows when noticing that 
Be\(A*’ = A*\B. Oo 


2.14 Let »’ be a o-algebra in 2 and let a € 2). Show that for 
{a} € 3’, the measure space (2, 3’, €,) is complete if and only 
if = P(Q). 


Solution. (i) If 3) = P(Q), then (Q, X’, €,) is obviously complete. 
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(ii) Suppose »’ is a proper subset of P(Q). We show that 
(Q, ’,€,) is incomplete. Denote 2’ = {a}° which is an element of 
»/ and it is a null set of e,. Then, €,((2’) = 0. Now, we show that 
there is a nonmeasurable subset of (2’. Suppose it is not true and that 
all subsets of 2’ are measurable. Note that 


P/M) = P(A) U{A CQ: A=A Ufa}: AC PID). 


Because by the assumption, »’ is a proper subset of P(Q), there is a 
nonmeasurable subset of Q, say P. Then, either P C 2’ or P = Q! 
U {a} where Q’ CQ’. Clearly, Q’ ¢ 2’, because otherwise P 
would. Thus, in either case, 2’ has a nonmeasurable subset. i 


2.16 Let (,27,~) be a measure space, G. = {GE X:u(G) 
< oo}, and Y= {QCN:QNGELX,VG € G,}. Define 
the set function j4., on 7, as 


_ J oQ), Qe 


Problem 1.13 claimed that j., was a measure on 2. Show 
that if the measure space (QQ, »’, ) is complete, then so is [.o. 


Solution. Let No be a [loo-negligible set. Then, there is a Y’,- 
measurable j1..-null superset N of N... This N must also belong to 
+) [because if it would not, j1.,(V) would be equal to oo], on which 
ju and [1 coincide. Consequently, N., is also a ji-negligible set and 
because (22, »’, 4) is complete, N., is in ¥7 and thus in 7’,. O 


2.18 Let 4u* be the outer measure generated by an extendible 
formatter (G, 7) on a nonempty set 0, * be the o-algebra of 
all j.*-measurable sets, and 4’(G) be the o-algebra generated 
by G. Show that a subset N C 2) is negligible if and only if 
p(y) = 0. 


Solution. Denote ju) = Ress» pi" and pp = Ressyg){o. 
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(a) Suppose N is negligible. Then, there is a ju-null superset of NV, 
say No. Consequently, ju*(No) = 0. By monotonicity of j1*, 


ein) =a. 


(b) Let N CQ be such that p*(N) = 0. First, we show that N € 
*, Let Q C Q. Then, by (2.3) and due to u*(QN N) = 0, 


w(Q) S w(QN.N) + w(QN.N®) = w(QNN®) < w*(Q), 


which yields 
(QQ) = w(Q0 N*) 


and hence, 


w(Q) = Ww(QNN)+ pw (QN N°). 


That N € X”* yields pg) = 0 and, in particular, p5(V) < co. By 
Lemma 2.17, there is an No > N such that No € »'(G) and 


Lo(No\N) = 0. 
The latter leads to 
LUCNo) = Ho(No) = po) = 0 


and thus implies that N is negligible. 


O 


3.1 Let H={e = (a1,...,¢,) € R": 2; =a € R} be a hyper- 
plane orthogonal to the ith coordinate axis. Show that H is a 
A-null Borel set. [Hint: 1) Show that H is closed in (R", 7) 
and hence Borel; 2) find a relevant countable cover of H by 


rectangles from S and apply Lemma 3.6.] 


Solution. Clearly, H° is open. Indeed, let x € IR” such that the 7th 
coordinate 7; = b > a. Then, the distance from x to A is exactly 
b—a. Thus, if B(x,r) is an open ball with radius r < b—a, 


B(a,r) C H°. The same holds true with a point y € 


R” whose ith 


coordinate y; = b < a. Thus H is closed and consequently, H is 
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Borel. Let Ri: = (ax, 0%] € R" be a half-open rectangle located 
upon #7 such that all but its 7th coordinate are — k for a; and k for 
(;. The ith left coordinate of A; is a and the right coordinate of R; 
is a + €% such that e, = 2-*(2k)'"e, e > 0. 


A 
xX; 
Rea a a 
| | a | H 
-k-1 k+l 


Obviously, H C R: = U R,. We now calculate A" (R): 
k=1 


n(n) = (TL - (=) 


x (a+ 27*(2k)*"e — a) = 2-*e 
=> 
A"(H) < A"(R) S DipsA" (Re) =e. 


Thus, by Lemma 3.6, 


3.6 Lemma. 4 set N CR” is A-negligible if and 
only if for each € > 0 there is a countable cover by half- 
open intervals {I;,} C S of N such that \\—_,A°(In) < €. 


634 CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 
\"(H) = 0. Oo 


3.3 Show that f defined by (3.5) in Remark 3.5(772) is an extended 
distribution function. 


Solution. We assume that R, C C, n= 1,2,..., where C’ is a com- 
pact set in (R, 7). Recall that any “figure” R,, € R is a finite union 
of disjoint intervals of type (a, b]. Because of the right-continuity of 
f, it can be easily shown that, for each fixed ¢ > 0 for any figure 
R,,, there is a subfigure B,, C R,, such that B,, C R,, and such that 


fus(Rn) — wp( Bn) < €27". 


It also follows that (]} B,, = @. We claim that there is an r such that 


n=1 


: 
the intersection (] B, = ©. To see this, observe that 
k= 


{C\Bn = CN (Bn); N =1,2,...} 


is an open cover of C' in the relative topology (C, T MC). Because 
compactness is weakly hereditary and C’ is closed, it follows that C’ 
is also compact in 7, 9 C’. Thus, the above cover reduces to a finite 
subcover, for example, C\B,,...,C\B, yielding 


co =U (C\B,) = c\( A Br). 
k=1 k=1 
Thus, 
(1) Be=@ and (1) BR =@. 
k=1 k=1 


Now, for all n > r, 
(|) Be =@ 
k=1 


and 
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Lp (Rn) = wp Rn\ ia Br) + (1) Bx) 
c=] =1 


= by (F0\ a Bi). 
k=1 
In as much as {R,,} is monotone decreasing it follows that 


RAC A Bi) = Uca\Bi) ¢ U (\ Be) 


Observe that this is the desired inclusion implying the estimate 
u¢(Rn) < €. The above inclusion was due to the inclusion R,,\ By, 
C R;\By which holds for all k < n (as long as n < oo). Hence the 
above countable intersection reduces to a finite intersection of the 
sets By, k=1,...,r. Thus we have 


< mee u\Bi — Dnyj(Ra\ Be) 


k=1 
<e(1—2-") 
which shows that ju7(R,) — 0. O 


3.4 Let f; and fz be two extended distribution functions and let j1, 
and jig be the corresponding Borel - Lebesgue - Stieltjes mea- 
sures induced by these functions. Show that ju; = [2 if and 
only if f, — fo =, where c is a constant function. 


Solution. (i) Suppose f, and f 2 differ in a constant. Define 
uf.((a, b]) = fi(b) — fi(a). Therefore wy, = poy, on semi-ring S and 
thus obviously on the ring R(S). As Lebesgue - Stieltjes contents, 
they have all conditions met for a unique Carathéodory extension of 
the corresponding formatters from the ring 7(S) in R to the Borel o- 
algebra B(IR). See also Remark 3.5(7-7i7). 
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(ii) Conversely, if ju; and pg are two Borel - Lebesgue - Stieltjes 
measures induced by two extended distribution functions f; and fo, 
such that 4, and slg coincide on semi-ring S, then we have for a 
fixed a and any x > a, 


i ((a,2]) = filz) — fo(a) 
p2((a, z]) = fo(x) — fala). 


Therefore, f(x) = fo(x) +c, where c = fi (a) — fo(a). 


For 7 <a, 


pa ((w,a]) = fila) — fila) 


and 
Hi2((a,a]) = fo(a) — fa(z). 
So, 
— fila) + filz) + fola) — fala) =0 
and 
fil) = f(a) +e, 
Fe Cen Oo 


3.9 Let f be a distribution function and let uf denote the Borel - 
Lebesgue - Stieltjes measure induced by /. Justify with a proof 
or give a counter-argument: 


a) Must f be an extended distribution function? 


b) Suppose g is a function defined by (*) of Remark 3.5 
(277). Is g a distribution function? If your answer is “yes,” 
sys 7? 


Solution. In Remark 3.5(zi7), definition (3.5) of an extended 
distribution function may look confusing, primarily because we set 
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f (0) = 0. However, we recall that (3.5) defines only one repre- 
sentative of the entire class of distribution functions. The rest of 
them can be restored by adding to f the set R of all real constants. 
Thus, 9, C D, even though f(0) need not be equal to 0. Further- 
more, if we employ (3.5) by means of ju, then the obtained function 
this way is an extended distribution function, say h, which is equal to 
zero at the origin, but it need not be a distribution function, as it need 
not vanish at —oo. However, f and h differ in a constant. 


a) Therefore, the answer to a) is “yes.” 


b) As pointed out, g need not be a distribution function. How- 
ever, if it is one, it must necessarily equal /. O 


3.12 Let (Q,B, ) be a Borel measure space, such that the Borel o- 
algebra B is generated by a Hausdorff topological space 7, and 
j is a finite Borel measure on B. For any subset @ C (, denote 
by K(Q) the collection of all compact subsets of Q. Show that 
the family M C B of all Borel sets B such that 


y(B) = sup{y(K): K € K(B)} 


is a monotone system in (2, that is, all Borel sets that can be 
approximated from below by compact Borel subsets form a 
monotone system. 


Solution. We need to oy that for each monotone increasing se- 
quence {M,,}7 CM, U M,, € M, and for each monotone de- 


n=1 


creasing sequence {M/,,}| CM, Q M, € M. 


n=1 


a Let {M,}1 CM. Then V ¢/2 > 0 and Vn = 1,2,..., there is 
€ K(M,) : 


u(Mn) < (Kn (€)) + €/2. 
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k=l 
Ky(e) © Cale) © M,,, which yields 


(Mn) < w(Cr(e)) + €/2. 


By continuity from below (Lemma 1.6) we have 


< Jim, u(Cn(e)) + €/2 = w(C(e)) + €/2, 


where C' = [J C,,. Because {C,,(€)} is a monotone increasing se- 
n=1 

quence of compacts subsets of M/, for any ¢/2 there is an N EN 

such that, Vn > N, p(C(e)) < w(Cr(e)) + €/2, what finally yields 


w(M) < w(Cr(e)) +e. 


In other words, Ve > 0 there is a compact subset of M satisfying the 
above inequality. This is equivalent to 


p(M) = sup{u(K) : K € K(M)} 
> Mem. 


2) Let {Mn} = M. Then Ve > 0Vn = Vy 2y aes 
4 K,(€) € K(Mn) : u(M,,) = p( Kn (€)) + ei ‘ 


Let K(e): ={)K,,(e). According to Problem 6.10, Chapter 3, any 
n=1 


intersection of compact subsets in a Hausdorff topological space is 
compact. Thus, A(<) is also compact and in addition, K(e) C M. 
Because yu is finite, (M\K (e)) = w(M) — w(K (e)). On the other 
hand, 
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m\K(e) = | Aan|\| AK) 


n=1 


=f) Gaile). (P3.12) 
n=1i=1 
Now, if {A; : 7 € I} and {B;: 7 € J} are two indexed families of 
sets, we obviously have: 


Uy Leave) | ea ol 
= (ua) {ne} =n [eo (94)| 
=f) U(Ain Bi). (P3.12a) 


jeJ ier 


Consequently, due to (P3.12a), equation (P3.12) yields 


M\K(e) =) UM Ke) =f) Ua\ Kile) 
= UN(MAK(2)). (P3,12b) 


We show that 
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M\K(e) © U (My \K;(e)). (P3.12c) 
k=1 


Indeed, if M\K(€) 4 O (which otherwise would be trivial), then, 
due to (P3.12b), for any w € M\K(e), there is an ig such that w € 


() (M,\ i, (e)). Therefore, w € M; 


n=1 


of  (M\K;(e)) herewith validating inclusion (P3.12c). Conse- 
k=1 


»\i,, Which in turn is a subset 


quently, we have 


k=1 
— (Mr) — w(Ke(e)) Se 
=1 
and hence, M € M. 
So we proved that M is indeed a monotone system. O 


3.13 Let (R”, B(R”), 1) be a finite Borel measure space (that is, ju 
is finite). Show that js is weakly regular, that is, it is outer and 
closed regular on B(R"). 


Solution. Let W C B be the family of all weakly regular Borel sets. 
That is [see Definition 3.12 (iv)] for A € W, 


p(A) = inf{u(O): ACOE x} (P3.13) 
and 

(A) = sup{u(F): AD Fe ro}. (P3.13a) 
Weak regularity seems to relate to some, but surely not to all, Borel 


sets. However, we prove that under the above assumptions, WV = 
B(R"). It can be easily shown that (P3.13 - P3.13a) is equivalent to 
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the following assertion. Given an € > 0, there is an open set O and a 
closed set F’ such that 


FCACOand p(O*1KF) <e. (P3.13b) 
(i) Obviously, from the definition, open sets are outer regular. 


(it) Let F, be a countable union of closed sets. By Problem 1.26, 
Chapter 3, each open set in 7, is an F7, set. That is, 
T DO= U Fy. 
k 


=1 


By Problem 1.18, Chapter 3, we can even assume that {F),} is a 
monotone nondecreasing sequence from 7{ (which is obvious any- 
way). Then, by continuity from below, 


w(O) = limgo U(F,) = sup{u(F’) : 7S 3 F CO} 


that implies that 7. C VW, 1.e. open sets can be approximated by open 
sets and by closed sets (a weaker form of the closed regularity). 


(iii) We show that W is a o-algebra. 


a) R" € W because it is open. 


b) Let Q€ W and an ¢>0 be chosen. Then, according to 
(P3.13b), there are open and closed sets, say O and F’, respectively, 
with 


FCQCOand p(O*\/F) <e. 
Hence, 
TeOCOCr eR, 


and 


WP \O°) = w(R") — w(F) — p(R") + (0) <e 


implying that Q° € W. 
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c) Let {Q,} C Wand e > 0 be chosen. Then, for each k, there is 
a couple of sets, Fy, € Tf and O; € 7, such that 


Fy © Qe © Of; and u(Ox\ Fr) < é/2*. 


Co 


Let O = [J O; and F, = UU Fy. Then, because of 
k=1 k=1 


w(O\F,) < »( Uo.\ri] < 35% u(Og\Fe) < €. (P3130) 


Although F’, is not closed, it is an F, set itself, and as such it can be 
represented as the countable union of a monotone nondecreasing se- 
quence, say {C;,}, of closed sets. In fact, this sequence can be chosen 


k 
as 4 LJ F; 7. By continuity from below we then have 
j=l 


k 
BFS) = limp oops ( U ns) 
jel 


implying that for the same ¢ > 0 chosen earlier, there is an N such 
that Vn > N, 


u(F,) — (0) <e. (P3.13d) 
jel 


Therefore, from (P3.13c) and (P3.13d) we have 


1 (os) = u(0)- 1 ( un) 
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= w(O) — wFo) + w(Fo) — ( U ns) < 2¢, 


j=l 
with 
k ee) lee) ee) 
Ta) Clr H= FC) eee Ope es, 
j=l jel k=1 k=1 
concluding that (J Q; € W and that W is indeed a o-algebra. 
k=1 


(iv) Because W is a o-algebra and 7. C W, it follows that B C W 
and thus W = B. O 


3.18 Show that a o-finite Borel measure js on (IR, B(R)) need not 
be outer regular. 


Solution. Let 1 be defined as follows. Let B: = {1,5,...} and V 

n=1,2,..., w({4}) =+, while for every subset A of B® we let 

(A) =0. If Ba: = {1,§,-..,+} then obviously (B, UB) is 

finite and B, UB°t R=. Although u(R) = u(B) = > 4 = 00, 
n=l 


js 1s finite on B, UBS => wp is o-finite. We show that Borel sets 
cannot be ju-approximated by open supersets in general. Let us take 
set {0}. If O is any open set containing 0, then O must also contain 
an open interval centered at zero and thus, for some € > 0, 


pw(O) > p((—¢,+e)) > u( s :) = oo (for some k). 


Thus, (O) = co holding for any open set containing set {0}, 
whereas ju({0}) = 0. O 


3.19 Let (R",B(R"), A) be the Borel-Lebesgue measure space. 
Show that the Borel - Lebesgue measure is strongly regular. 


Solution. From Definition 3.12(iv-v), we need to show that is 
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(A) finite on K(IR”) (which is obvious, because in (IR", 7), compact 
sets are d.-bounded and 4 is a Borel - Lebesgue - Stieltjes measure) 


(B) inner regular (it is done as per Problem 3.15, because \ is a o- 
finite Borel measure) 


(C’) outer regular, i.e., for each Borel set A € B(R"), 


\(A) = inf{u(O): A CO € Te}. (P3.19) 


The latter is what we are going to work on in this problem. Firstly 
assume that, given an A € B, \(A) < 00, because otherwise, the re- 
sult is trivial considering monotonicity of A. 


We need to show that for each ¢ > 0, there is an open set G € T, 
such that 


NA) +e> AC), (P3.19a) 


By Carathéodory's extension, A can be approximated by half-open 
rectangles. More precisely, if 4 = €4(S") is a the set of all at 
most countable covers of A by half-open rectangles (i.e., elements of 
the semi-ring S”), then 


MA) = inf{TI A(R.) + {Rp} © GA J. 
Therefore, for each « > 0, there is a cover, say {R;,} € G4, such that 
A(A) + eh > 322° A(Rp). (P3.19b) 


In turn, given ce, for each R; there is an open superset, in fact a 
rectangle, say G,, such that 


N(R) -e/2 = Gy: (P3.19c) 


If G = UG, then G is an open superset of A. It remains to show 
k=1 
that G has the required property (P3.19a). Indeed, from (P3.19b-c) 


A(A) + § > DRAG) — § = A(G) - § 
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which holds due to o-subadditivity of A. O 


3.21 Show that if N is a negligible set in (R",6,A), for each 
€ > 0, there is a countable cover of N by disjoint half-open 
cubes {C;,} such that 


yi (Ce) SE; 


Solution. By Definition 2.5(2), if N is negligible, there is a Borel A- 
null superset, say No, of N, that is, No € BUR”) such that No > N 
and A(Nj)=0. By Problem 3.20, given an ¢ > 0, there is a 
countable cover of No by disjoint half-open cubes {C;,}, such that 


Cs Oo 


3.24 Let y be a translation-invariant Borel measure on G(R”) and 
let ju" be the outer measure produced by (B(R"), j) and Br be 
the corresponding o-algebra of j:*-measurable sets. Show that 
(@) =p = W(C)A* on P(R") and fg = W(C)Ag on Bi, where 

C is a unit cube. 


(17) By = £*. 
Solution. 


(¢) By Theorem 3.10, w = w(C)A on BCR"). Let pi be the outer 
measure produced by pw(C)A and 5 = u(C')A*. Because p*, jj, and 
jt, coincide on B(IR”), by Remark 3.3(2), u* = wit = 5 on PCR"). In 
particular, 


Ly = WCAG on Bi. 


(22) In light of (7), we take p(C’)A* as a version of u*. Let Q € 
R"). Now, the equation 


x 


WQ) = wQN B) + wX(QN BY) 
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holds true if and only if 
MQ) = M(QN BY + V(QN BY) 


does. Hence B € B/ if and only if BE L*. O 


3.28 Show that there is a non-Borel set in P(R). 


Hints: We call x,y € R equivalent (x ~ y) if and only if 
x — y € Q (rational numbers). For every real number x € R 
we assign another real number y to the class A, if and only if 
z—-yEeEQ. 


1) Show that (IR, ~ ) is indeed an equivalence relation. 


Let €|. be the quotient set of modulo ~ . Using the axiom of 
choice we select any element from each class of €|. that 
belongs to set (0,1]. Denote by A the collection of all such 
elements. 


2) Show that such a selection is possible taking into account 
the axiom of choice; that is, it can be shown that Vx € R, 


A, (0,1] 4 @. 
3) Show that set A has the following properties. 
Va#rEQ, {q+ A} n{r+A}=2. (P3.28) 


R can be restored from A as 


R= U(¢qt+A)= “Xi(qt+A). (P3.28a) 
qeQ qEQ 


4) Finally, let @ = QN (0, 1]. Then U (q+ A) C (0, 2]. If A 
qeO 

is a Borel set (and this is the assumption that will lead to a 

contradiction), then by Problem 3.11, z+ A is a Borel set 

too; and by the translation-invariance of Borel - Lebesgue 

measure \, A(x + A) = A(A) implying that 
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4.3 


\(U (a+ 4)) = EAG+ A) < (2) =2. 


qeQ qeQ 


Thus the above series is finite; and because the A(q+ A) 
values are equal for all g € Q, each of them must be zero, 
which implies that \(q + A) = 0, Vq € Q. But 


R= (q+ A) > A(R) = LAG+ A) =0, 
gEQ qEQ 


which is an absurdity. Thus, our assumption that A is a Borel 
set was wrong. 


Let (Q, 2’, w) be a measure space with =R, Xs = {ACR: 
either A or A° <N} and let p(A) =0 for A=<N and 
pA) =1 for A°<N, Let O=R, 2 = B(R),. Define 
[Q, 0’, f] as 


0, if x is rational 
f(z) = 1g-(x) = { 1, if z is irrational. 


Prove that f is 3/-»7’-measurable and determine pi f*. 


Solution. (i) Let B be any Borel subset of R. Then, f*(B) € 


nS 


R, O, Q, Q‘}. The latter is a o-algebra which clearly is a subset of 


>. Thus, f is measurable. 


(i7) pw f*(B) = a *({O}) = uw(Q) =0, because Q is countable. 


uP Lh) = o(Q 


1, because Q° is uncountable. Then, 


uf*({0,1}) =O+1=1. 


Now, let B be any Borel subset of R. From (7) we have that 
f*(B) € {R, G, Q, Q*}. Hence, wf*(B) € {0,1}. Oo 


4.5 


Let M(R") be the collection of all motions on (R", d.). Show 
that (M(R"), 0), where o is the composition operator, 
forms a group. 
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Solution. We define a motion as M=T,08R, where FR is an 
orthogonal matrix and T,,(2) = «+a, x,a € R”. [To keep consist- 
ency in notation we will write A(x) instead of Az, if A is ann x n 
matrix. | 


(i) Let b € R” and consider 
Ro T(r) = R(x + b) = R(x) + Rib) 
= T, 0 R(z), (P4.5) 
where a = R(b). Hence, b = R~!(a). In conclusion, 
TeRohe. (P4.5a) 


(it) We show that MM, o Mp is a motion. Then M, = T, oU and 
M2 = T, 0 V, where U and V are othogonal matrices. Thus, 


Mo My = (T, 0U) 0 (Tho V) 


=Uo Ty-1(a) oT,0V (P4.5b) 
= 17 0 Ty-1(a)-+b eV (P4.5c) 
= Ty-1(U-1(a)+b) oUo V, (P4.5d) 


which is a motion, because U o V = UV is an orthogonal matrix. 


(iti) We can represent the unity motion as a composition of the 
unity matrix I (which is orthogonal) and translation To. 


(iv) We show that for any motion, say M, there is a motion M) 
such that M,o My =T. Indeed, let in (P4.5c) U~!(a)+b=0 
[implying that b = — U~!(a)] and V = U'. Then, from (P4.5c), 


M,oMyz=UoTjoV= TI, 
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if My i U-1(a) ° ot. O 


4.6 Let f be a homothetic function (f(x) =azx) defined in 
Proposition 4.3, part 1, \ be the Borel - Lebesgue measure on 
BIR"), A* be the Lebesgue outer measure, £L* be the c-algebra 
of Lebesgue measurable sets, and \j be the Lebesgue measure 
on £*. Let yu* be the outer measure generated by the image 
measure Af*, 6’ be the o-algebra of all j.*-measurable sets, 
and ju) = Resg; j". Show that: 


a) p* = jal "A* on P(R"). 
b) 15 = a "dj on B*. 
c) 5, = 


Solution. a-b) Let pu} be the outer measure generated by Borel 
measure Af* and j15 be the outer measure poe by Borel 
measure me Then, the outer measures ju}, v5, and A* coincide 


on B. Indeed, ju; = 4; on B and ju, = jg. In addition, 


iar 


pe = pA = pra” = 45 on B,asd = * on B. 


According to Remark 3.2(7) [Theorem 3.1 states that the Lebesgue 
content on R(S) in R” is o-additive. This, obviously, implies that 
the Lebesgue elementary content is also c-additive on S.], 


pt = pt = JA* on P(R*) 


= lal” 


and their restrictions 
* ok 
Hig = Hao = ar Xo on B". 


c) Denote p* = = apr’, which by (a-b), is also the outer measure 
produced by (6, Af*). Then, for each Q € P(R"), the equation 


WQ) = w(QN B) + wX(QN BY) 
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holds true if and only if 

MQ) = M(QN BY + M(QN BY) 
does. Therefore, B € 6* if and only if B € L*. O 


4.8 Show that the Lebesgue measure Aj on L* is translation-in- 
variant. 


Solution. 


(i) First we need to show that for each £*-measurable set A, 
A+aze€L". 


From Remarks and Definitions 3.2(22) [or more specifically, from 
Theorem 2.19(27)], (R", £*, AG) is complete and coincides with the 
completion of (IR", B, A). Hence, each A € L* can be represented as 
A= BUM, where B € B and M is negligible and, consequently, 


A+a2=(B+2)U(M+2) 


To prove that A+ 2 € L* it is sufficient to show that M+ 2 is 
negligible, as by Problem 3.10, B + x € B. Because M is negligible, 
there is a Borel measurable superset, say N, such that A(V) = 0. 
Clearly, N + x is again Borel and because Lebesgue-Borel measure 
A 1s translation-invariant (Proposition 4.3), ACV +2)=0. 
Obviously, N + x is a superset of M + x. So, we are done with (2). 


(ii) If \ is the completion of A, then \(A + x) = A(B + 2) and by 
the translation-invariance of A, the latter is equal to \(B). Because 


= Aj, we have that 


Ni(A + 2) = AB) = \5(BU M) = A8(A). oO 


4.10 Let (Q,2°,P) be a probability space and X; be a (2, ¥’)- 
(Q;, 27;) r.v., 2 = 1,...,. Show that X1,...,X;,, are indepen- 
dent if and only if o-algebras 4’;'s have M -stable generators G; 
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such that the system {Xj*(G1),...,X7*(Gn)} of families of 
events is independent. 


Solution. (i) Suppose {X7*(Gi),...,X7*(Gn)} are independent. 
Then, if G; is a generator of ©;, X}*(G;) is a generator of X}*(%j). 
Indeed, by Problem 3.3, Chapter 4, if f :Q — ’ is a function and 
G’ Cc P(’), then 


FP (LG) = BF" (G")), 
or in the form 
3(XF*(Gi)) = XP (Ai) = XP(L(Gi)). 


Furthermore, it is easily seen that with G;'s being ™-stable, so are 
X;*(G;). Thus, by Problem 1.22, the system of o-algebras 
{X7*(L1),...,X**(L,)}is independent and so are the r.v.'s 
Xj ,...,Xn. We only notice that Problem 1.22 required that the 
generators G;'s be semi-rings, but in the proof of Problem 1.22 - 
stability was all we needed. 


(ii) The converse is trivial. If X,,...,X, are independent, then the 
o-algebras {X7*(21),..., X7*(2'n,)} can serve as M-stable gene- 
rators of themselves. O 


5.4 Show that if f? € C~!, then f € C~!. Show that if f? € C7! 
then f need not be in C"!. 


Solution. (i) f° is measurable if and only if {f* < a} for all real 
values of a. But then, {f? < a} = {f < a!/*} if we consider only 
real values of a. 


(it) Let A be a nonmeasurable set and f = 14 — 1,4c. Then, f is a 
nonmeasurable function considering that f :Q— {1, — 1}. How- 
ever, f? = 14 +14- —0 =1 is a measurable function with respect 
to the trivial o-algebra {0 O}. O 
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5.8 Show that if f has a derivative at each point of R, then this 
derivative is Borel-measurable. 


Solution. If f has a derivative on R, then for each x, 


f'(o) = im Leet, 


n+ CO 
where {h,,} is a sequence convergent to zero. Denoting 


x hn = y 
f(x) = f(at W) f(x) 


we see that for each n, f,, is Borel-measurable. This is because f is 
differentiable and therefore continuous. Now the statement is due to 
Proposition 5.6(v7). C 


5.9 Show that the space (C~!, rp) is Hausdorff. 


Solution. First note that C~! is a subset of R° and because (R, 7.) is 


R°, rp) is Hausdorff, as 


Hausdorff, the Tychonov product space 


nos 


we know from Problem 5.17, Chapter 3. The statement follows from 
Problem 6.12, Chapter 3, according to which Hausdorffness is 
hereditary. O 


5.10 Let (9, 4',) be a complete measure space and let f € 


C-1(Q, X; IR). Suppose that g: Q — R is an extended, real- 
valued function. Show that if g= f (mod yu), then g€ 


C(O, ©; R). 


Solution. The assumption g = f (mod jz) implies that N = {f 4 g} 
is a negligible set. Because the measure space (Q, »’, z) is complete, 
both N and N° € »’. Obviously, for any Borel set B, 


g(B)N Ne = f*(B)NN-. (P5.10) 
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Because JN is negligible, so is g*(B) MN as a subset of N. Because 
(Q, 5’, 4) is complete, g*(B)MN is measurable. From (P5.10) it 
follows that g*(B) ™ N° is also measurable. Hence, 


g (B) = [9*(B) NN] U [9(B*) N N*] 


is measurable. Therefore, g**(B(R)) C © implying that g € 
C10, 2). O 


§.11 Let (,27',42) be a complete measure space and _ let 
{fr} CC1(Q, Y;R). Suppose that lim, ... fn exists and 
fn — f pointwise pi-a.e. on 2, where f is an extended, real- 
valued function. Show that f € C7!. 


Solution. By Proposition 5.6(v2), 
um, fn € COS), 


Because _ lim. fn = f (mod j1), due to Problem 5.10, we have f € 
meal (9 O 


5.12 Let Q be a non-Borel subset of R (such as one in Problem 
3.16) and let C’ denote the Cantor ternary set. Define the 
function 


f(a) = lgno(@)sinx + 1gney(a)2”. 


Is f Lebesgue measurable, that is, f € C~'(R, L*, Aj)? 


Solution. Because QM C is negligible, f(x) = x? (mod X). Because 
the Lebesgue measure space (R, £*, Aj) is complete, by Problem 
5.10, f is Lebesgue-measurable. O 


5.13. We knew from probability theory that r.v.'s X),...,X, ona 
probability space ({2, X’,P) and valued in R (each), endowed 


with the Borel o-algebra B(IR), are independent if and only if 
their joint probability distribution function (commonly abbre- 
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duct [] P{X; < x;} of their marginal PDF's. Prove that this is 
i=1 
true using the original definition in Example 4.5 (777). 


viated as the PDF) P{X1 < 21,..., Xn < %,} equals the pro- 


Solution. (1) Suppose 


PY XS Big cig Ay oy} = [Pe = a} 
=i 


holds true. From Proposition 5.2, we learned that the collection of 
sets G = {[—co, a] : a € R} is a generator of the Borel o-algebra 
B(R). Clearly, G is M-stable. Therefore, by Problem 4.10, Xy,..., 
X,, are independent if and only if ©; = B(IR) has an M -stable gene- 
rator such that the system {Xj*(Gj),...,X7"(G,,)} of families of 
events (with G; = G) is independent. The latter means (see also Prob- 
lem 4.11) that 


P {X1 e [—co, x1] } sell 4X € [ —00, tn] } 


=] P{X, € [—00, 2i]} 


is a sufficient condition that the system of o-algebras { X}*(21),..., 
X;**(57,,)} is independent and so are the r.v.'s X1,..., Xn- 


(ii) The converse is due to Problem 4.10. O 


6.2 Let 2 be an uncountable set and let = {A CQ: A or A is 
at most countable}. Show that f € C~1(Q, 2’) if and only if f 
is constant everywhere except on an at most countable subset 
of 22. 


Solution. (i) Let f be measurable. 


Case (a) f = 1,4. Then, A € » and thus either A or A° = N. Either 
way, f meets the assertion. 
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Case (b) f = >>;",ci14,. Then, at least one of the A,;'s is uncount- 
able on which f is a constant. 


Case (c) f € C;'. Then, there is a monotone sequence {g,,} of 
simple functions such that f = sup g,. Suppose B,, is an uncount- 
able set on which g,, = b,. Obviously, {b,,} is a monotone non- 
decreasing sequence whose nia equals f on some set. We claim 


that surely f(w) = sup b,, on ia) B,,. Indeed, suppose we have 


91 < go and g; = b; on B, and go = by on Bo. 
Clearly, 

g2(w) = gi(w) V go(w) = bp on By NM By. 
Now, obviously, 


(B) =U mn 


n=1 n=1 


Thus, B: ={) B,, is uncountable. Hence, f = b = sup b, on B. 
n=1 


Case (d) fj = f* —f . Suppose f* =b on B and jf =c on C. 
Then, because B® UC <N, we have that BMC is uncountable 
and thus, f =b—con BNC. 


(ii) The converse. Let f = c on some A such that A is uncountable 
and A° = N. Consider the set 


{f>ch={f=c}+{f>c}=A4B. 
Obviously, 
{f<ch=A°N BR SN. 


Thus, {f > c} € and so is {f < c}. Leta < c. Then, 
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{f<e) Cty <c} SN, 


whereas 


{f 2c} C{f 2 a} 


is uncountable. Thus, for all a<c, {f <a} and {f >a} are 
measurable. It is easy to see that these sets are measurable for a > c 
as well. O 


6.5 Theorem 6.4 states that for each f € C;', there is a monotone 
nondecreasing sequence {s,,} of nonnegative simple functions 
(from W,) such that sup{s,,} = f. In Remark 6.5, it is shown 
that unless C;' is first countable in the sense of Tp (which in 
most applications it is not), C;' is a proper subset of W, (the 
Tp-closure of W,). Does it mean that the converse of Theorem 
6.4 does not hold [i.e., that, although sup{s,,} € YW, of some 
sequence {s,,} of simple functions, it need not belong to C;,']? 
Analyze and explain this. 


Solution. By Theorem 5.6(vi), C~' is closed with respect to all 
pointwise limits. In other words, (C~', rp) contains all of its sequen- 
tial limit points, as does the relative topology (Cj',tpNC;'). 
Therefore, all monotone nondecreasing sequences of simple 
functions (which are clearly pointwise convergent) will have their 
limits in C{!. 


However, (Cz',tp MC ;') does not contain its closure points, 
because as argued in Remark 6.4, for most cases, it remains a proper 
subset of W,. A topological space, as we learned from Remark 1.15 
and Examples 1.16, can contain all of its sequential limit points and 
yet not being closed. In conclusion, Theorem 6.4 can be extended by 
the inclusion of the phrase: “Conversely, every limit of a monotone 
nondecreasing sequence of simple functions belongs to C;'.” O 
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Chapter 6. Integration in Abstract Spaces 


1.3 Let (, 27,4) be a measure space and c > 0. Show that for 
each f € L'(Q, X, pw; R), 


[fd(cw) = cf fap. 
Solution. First observe that cj is a measure on (2, 4’). Let 


f = Wpareela, € ¥(O, Z). 
Then, 


J fd(cp) = Dh sancn(An) = ef fap. 


Let f €C;'. Then, there is {s,}? CW,(Q, 2) such that f = 
sups,,. Hence, 


J fa(cu) = sup [s.d(cy) = sup{e [sndu} = cf fd. 


Finally, for f € C~'(Q, X), the proof is routine. oO 


15 Let f €C1(Q, 2, u;R) and suppose that there are two non- 
negative integrable functions wu and v such that f= u-—v. 


Show that f € £1(Q, 3’, y; IR) and that 
Sfdp= Judu— fod. 


Solution. (i) Let f = ft — f~. Then, 
ft-f( s=u-vas fttveutf. (P1.5) 
Furthermore, 


f=u-vsusutva ch 
and 


—f=v—-us<sv<utv=h 
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Then, from 
-hef<hslfi<h 


and because by Proposition 1.3(i7) h is integrable, so is |f|, by Pro- 
position 1.3(iii). The latter means that f € L?. 


(ii) Now, because by Proposition 1.8, Lis a semivector space, we 
have: 


S(f* +u)du= fftdut fodu 
Sut f)dp= fudu+ f fo du. 


Because of (P1.5), the left-hand sides of the last two equations are 
identical; so are the right-hand sides and thus we get 


Sfdu= ffrdu—ff-du= fudp— fodp. Oo 


1.7. Show that (£1(Q, ©, w;R),R, +, -, X ) isa Riesz space. 


Solution. Because in light of Problem 1.6, (£1(Q, 2’, u;R),R, +, 
-, <) is a partially ordered vector space [see Definition 7.7(i)], it 
remains to show that f,g € L' implies that f Vg € L' and f Age 
L’. Clearly, 


lf Val 3 lfl+ gl and |f Ag] X |f|+\Ial 


that yields the assertion. O 


1.9 Let (9,2; 41) be a measure space and let f ¢ C-1(, Y; R). 
Show that f € L'(Q,X,y;R) if and only if there exists 
g € L(Q, X, ;R) such that | f| < g. 


Solution. 
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(i) If such a gE L1(Q, 2,4) exists with |f| <g, then, by 
monotonicity of the integral (Proposition 1.12), |f| € L'(Q, ©, 2) 
and hence f € L1(Q, X’, yu), as f < |f|. 

(ii) f € L1(Q, XY, w) if and only if | f| € L1(O, X, yw), as noticed 
in Definition 1.10(i). Hence | f| can be taken as g. O 


1.11 Show that if feEC1(Q,2;R) then f € [0], yields that 
J fdu = 0. Does the converse hold true? 


Solution. By Problem 6.4, Chapter 5, f € [0], = f*,f~ € [0);. 
By Lemma 1.15, both [f*tdy and ff dy equal zero. Thus, 
J fd = 0. However, if [f*du and f f~dy are equal (such as if 
f(x) = sin x on [0,271]), then f fds: = 0, whereas f is not. O 


1.13. Generalize Proposition 1.16. Let (Q, 2’, 4) be a measure space 
and let f,g € C-1(Q, UR) such that f = g (mod 11) and f € 
(Q, 37, wR). Then 


Sfdu= fodu 


and g € L(Q, ’, 4; R). 


Solution. Suppose that f = g (mod jz) and that f € L(Q, ©, y;R). 
By Problem 6.3, Chapter 5, 


f* =g° (mod pz) and f~ = g” (mod 11). 


By Proposition 1.16 [Jf (Q, 2’, w) is a measure space and f,g € 
C'(Q, 2; R) such that f = g (mod p), then [ fdu = fgdul, 


Sfidu= fgrdu and ff-du= fg dp. Oo 


1.15 Show that for f €C71(Q,X), f fdu =0 for each Ac LY if 
A 


and only if f € [0],,.Does this assertion generalize Lemma 
iS? 
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Solution. 
(i) If f € [0],,, then for each A € XY’, Lyf € [0],,, and therefore, 
Sfdu =0 
A 


follows from Problem 1.12. 


(ii) Let f fdys = 0 for each A € »’. Denote 
A 


M={f>0} and N={f <0}. 
Clearly, M,N € 2’. By the above assumption, 
Sflmdy = J(- f)lwdy = 0. 
Because both integrands in the last equation are elements of 
Cz" (Q, 2’), by Lemma 1.15, 
flu. (— f)1n € [0]7. 
Hence, 


(MM) = WN) = uM +N) = wf # Of) = 0. 


(iti) No, this assertion does not generalize Lemma 1.15, because it 

requires that [ fds. = 0 for each A € ¥’, whereas Lemma 1.15 does 
A 

not. O 


1.17 Let 74 be a Poisson measure and let f € C;'(R, B;R). Show 
that 


Jfdte = Vrao ear F(n). 


Solution. Using Example 1.9 (iv) and Problem 1.3 we have the 


-—a a” 


validity of the assertion by setting ju, = e~° “Ten, n € No. O 
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1.20 Let js and v be two measures on (Q), >’) such that pp < v. Show 
that for each f € L'(0, XY, w;R)N L'(Q, ©,v;R), the integ- 
ral [ fd(v — yw) makes sense, f € L'(Q, Xv —u;R), and 
that 


[dv — 1) = ffdv — f fap. 


Solution. Obviously, v — j1 is a measure on » and for each A € %, it 
holds true that 


(v — p)(A) = (A) — WA). 
Let s = Sop_agla, € Y(Q, Y). Then 
Jsdy ~ p) = Vparoly — w)(Ar) 
= pn ORU(Ag) — Sp OAs), (P1.20) 
which yields the statement for integrable simple functions. Note that 
séEL'(Q,2,v—p), as follows from (P1.20), because s € 
1(Q, 1, uw) £1(Q, Xv). If f is nonnegative and {s,}t f, then 
from (P1.20), 
[fd(v — p) = sup fsndv — 1) 
= sup (af (An) — Deal Aen) 
= ffdv— ffdp. (P1.20a) 


Again, (P1.20a) yields that f € L1(Q,X7,v —). Finally, if f € 
HQ, 2, 2) O01, Dv), then f = ft — f- and 


Jfdv-w=ffrdv—-w- sfdw —-p) 
= fftdy—[f-dv—(fftdp— ff dp 


and the statement follows. O 
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1.22 Prove Proposition 1.18. Show that if f,g € C-'(Q, ©';R) and 
forge L1(Q,2,u;R), then 


ffdu= fgdu foreachAce X (P1.22) 
A A 


yields that f = g (mod 1). 


Solution. Let N = {f > g}. Then, the function (f — g)ly € C;!. 
From (P1.22) it follows that g € L'(Q,X,y) if f does. Equation 
(P1.22) and the linearity of L1(Q, X’, ) (see Problem 1.8) imply that 


S(f — g)1ndp = 0. 


By Lemma 1.15, (f — g)1n € [0]7. Similarly, (f — g)1u € [0)7, 
where M = {g > f}. Therefore, 


(G>o)lgea SU), 


from which it follows that {f Ag} is a y-null set or that 
i _—g = [0]... O 


1.25 Let g€C1(,Y,u;R,B(R)) such that fgdu>0 for all 
A 
A € 3. Show that g > 0 p-a.e. 


Solution. Let A= BM {g < 0}, B an arbitrary measurable set, and 
let h = g14,<o}. Then we have {hi1gdp > 0, whereas h < 0 on 22. 
The latter yields that [h1gdu < 0. Comparing the two inequalities 
we conclude that 


{h1gdu = 0 for every measurable set B. 


By Problem 1.15, then h = 0 (mod j1) implying that g1,,<9} = 0 p- 
a.e. It obviously implies that {g < 0} is a y-null set. Consequently, 
g > 0 pr-ae. CO 
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2.2 Generalize the monotone convergence theorem: Let {f;,}7 
Cc C1, ) and g € C71(Q, X’) such that f, > g for all n 
and suppose that [ gdju > —oo. Prove that 


sup{f fndu} = fsup{ fr} du. 


Solution. 


(i) Suppose [g < oo, which, along with [gdj. > —co means that 
g € L1(Q, X, 1). Therefore, 


neg > tp Hee, {frn—-g} CCH, 


and { f,, — g} is monotone nondecreasing. Hence, we can apply to 
{ fn — g} the monotone convergence theorem 2.1: 


sup | (fn — 9) = Jsup(fn — 9). 
The last equation yields 
sup f fn — fg = Jsupfn— fg, 
which in turn reduces to 
sup J fn = Jsup fn; 
because by the above assumption g € L'(Q, X, 1). 
(ii) Let f[g = +o. Then, from f,, > g it follows that 
ffrn>fg=t+o => supf[fr= +o. 


On the other hand, supf, > f,>9 = fsupf,>fg= +o. O 


2.4 Let {fn}| CC and g €C™! such that f, < g forall n. If fg 
djs < co, show that 


inf{ {fd} = finf{f,} dy. 
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Solution. Apply the generalized monotone convergence theorem to 
{= JahT and — @g. O 


2.6 Show thatif f,, — jf in mean then 
litt, yl foe [fap 
Solution. 
| {n- Pdu| = | ffrdu— J fde | 
< Jl fo— fl dus ian. 


Because a, — 0, for each ¢, there is an N such that for alln > N, 
Qn < €. O 


2.9 Let 


rte) ={ n, 0<a<+ 


0, 4 on ie GOOF 
Explain why 
flim oc fn Mdx) F lim yoo f fn ACA). 
Solution. (4) Clearly, 


; co, r=0 
(Sime { vo 


Therefore, f is Borel-measurable and f,, — 0 A-a.e. pointwise on 
Ri. Because f = 0 A-a.e., 


pfar=J,,Var=0. 


On the other hand, Jp, fndr = 1 forall = 120.33 4 
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(ii) Now the reason why the integrals are not equal is because we 
cannot use the Lebesgue dominated convergence theorem. The latter 
does not apply because there is obviously no dominant integrable 
function g >0 for the sequence {f,,} required in the Lebesgue 
dominated convergence theorem. 


(iti) Here is yet more evidence why the Lebesgue dominated 
convergence theorem does not apply, because if it would, then by the 
Lebesgue dominated convergence theorem part (7i7), 


lim y.-566 dle ~~ Fn\dA = Lim p00 f\0 _ fnldr 


should equal zero, which is not true because it equals 1. Thus, 
although { f,,} converges to 0 pointwise almost everywhere on R,, it 
does not converge to 0 in mean, that is in the L'-norm. Clearly, there 
is no other function to which { f,,} converges both pointwise a.e. and 


in mean. O 
2.10 Let 
nx, O<a< = 
- 2 2 1 2 
faz) =4 —n?(z—- 4), n Ses, 
0, x > Ac 
Show that 


flim fi dA < lim f fn dd. 


Explain why the Lebesgue dominated convergence theorem 
cannot be applied. 


Solution. It is easily seen that the graph of f,, represents a triangle on 
interval [0, 2] : Syne about 4, with a vertex at (4, n), and that 
fn iS zero on [0, 2] . Therefore, the function is continuous and 


integrable. Clearly, f,, - 0 pointwise A-a.e. More specifically, f,, 
converges pointwise to function 
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co, «rt£=0 
on eS 0: 


Thus f is zero A-a.e. on R,. Obviously, the Lebesgue integral of f is 
zero. On the other hand, evaluation of 


So fn(a)A(dar) = (R) fo” fn(a)dax 


aig + pointwise everywhere on R. This validates the 


gives §+54 
inieaaality, 


We remark that treating f as the Dirac delta requires an additional 
assumption imposed on f that 


[& f(x)g(x)dx = g(0), 


which in particular, would imply that Jo f fide = + and thus the 
equality of the above two integrals. aa the latter would 
contradict the Lebesgue integration and yield a different result. The 
Dirac delta is a so-called generalized function and its treatment falls 
into a different area of analysis known as “generalized functions.” 
The reader should be aware of the seeming discrepancy. 


As in Problem 2.9, we cannot use the Lebesgue dominated conver- 
gence theorem, because there is no dominant integrable function 
g = 0 for the sequence { f,,} required so in the Lebesgue dominated 
convergence theorem. 


2.12 Give an example of a monotone nonincreasing sequence of 
measures convergent to a set function js setwise such that ju is 
not a measure. 


Solution. Let \ be the Lebesgue measure on (R, 6). Define ju, = +). 
We show that im, fin. = Ue 1s: not me Take A, = [0, 1), 
Ay =[1,2),.... as a decomposition of R, = [0,co). Because 
fin(R4) = co for each n, it follows that «(R;) = oo. Now as 
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[n(A,) = 4, it follows that (A;,) = 0 for all k. Hence, pu is not o- 


1? 


additive. O 


2.14 Let f € L(Q, 2, u;R,B(R)) and suppose that {Q,,} is a 
countable measurable partition of (2. Show that 


J fdp = dina J fade. 


Solution. We start with 
Sfdu=fftdu—ffodp 
and then have 


Jftdp= f fsx o,du 


n=1 


by Beppo Levi 
So, also 


{f- d= dina Jf dt 
Thus, ; 


Jfdpu= vr, [fae oe Jt at 


Because at least one of the series is finite, we have 


ffdu = Yom J fd. O 


3.2 In Example 3.4, we showed that the Dirichlet function f on 
[0,1] is Lebesgue integrable, but not Riemann integrable. 
Because the rationals have the Lebesgue measure 0, the 
function f is equal to 0 (a constant) for A-almost all points on 
[0,1], and therefore, it is continuous almost everywhere on 
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(0, 1]. By Theorem 3.5, f must be Riemann integrable. This is 
just the opposite of the result of Example 3.4. What is wrong 
with this reasoning? 


Solution. It looks as if f is continuous almost everywhere, in fact, the 
restriction of f on [0, 1]\Q is continuous. However, f is clearly dis- 
continuous at every point if defined on [0, 1]. In other words, the set 
of discontinuities of f is not a A-null set. O 


3.3 Is the function f(x) 


= + on (0, 1] Borel-measurable and - 
integrable? Is f € L((0, 


1], B({0, 1], 4)? 


Solution. (1) Let 


n, ze [ rs 


Obviously, f;, is continuous and bounded on [0,1] and therefore, 
Borel. Because the sequence {f,,} pointwise converges to f, the 
latter is also Borel. 


(it) Clearly, the sequence {f;,} is monotone increasing, so that 
f = lim,_... fp in the topology of pointwise convergence. Therefore, 
we can apply the monotone convergence theorem to get 


(L) fo faa = limMp—oc (L) fy fn(a)A(dz) 
= liye (R)f, ina am 


= lim,6(1+ Inn) = co. 


(iii) Consequently, f ¢ L'({0, 1], B({0, 1], A)). However, because f 
is nonnegative and the integral of f is +00, it exists and hence 
F< ((0, 1], B((0, 1],)). O 


3.5 Let f : [0,1] — R be defined as 
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f(z) = (= 1)""'n, té€ A,=(=5,; le ee ee 
0, x = 0. 


Show that f is improperly Riemann integrable but not 
Lebesgue integrable. 


Solution. 


(R) fp f(@ dx = 2°, fn(— Dt dx 


n 


1 
= yak _ aa n+1 < %, 


as a Leibnitz series. On the other hand, (L) f fdX is convergent if 
and only if (L) is | f|dA is. Hence, 


L)fo|fldd = ey fnl(— Dt! |Adz) 
A, 


= a) fda — ise =e O 
An 


3.7 Investigate if the function f(x) = SF 1 uo}(@) (where 


0 <a<1) is improperly Riemann and Lebesgue integrable 
over R,. 


Solution. f is improperly integrable by the Dirichlet criterion applied 
to f = gh, where g(x) = = and h(x) = sinx. By Proposition 3.9, f 
is Lebesgue integrable if and only if R=(R)f,~ | f | < co. For 
a < 1, we have 


ooo p(ntl)r sina} 
R= Dat n x dx 


TT 


= yr m  sin(t-+n7) dt 


n=0J0 = (t+n7)@ 
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sint 
ane 0 ido: 4 (t+n7)4 


2 1 
2 re Lan= a (nt lye — OO, O<ad<l. 


For a > 1 we use the estimate 


R= ile. ae abs ase < Cya> kk oO 
3.9 Show that the functional 
b 
If ll = Sa lflaa 
defines a seminorm on L'([a, b], B({a, b]), A). How can || - || 1 


be “made” a norm? 


Solution. Obviously, all properties of the norm apply except for 
|fllj: =O if and only if f =0. However, by Lemma 1.15, 
| f\|;: = 0 if and only if f € [0],. Therefore, we can replace || - || 1 
with the restrictor || - ||, (bold face) acting on the quotient space 
L'([a, 6], B([a, 6]), A)|, modulo and thus the function 


(L' ([a, b], B([a, b)), A)|y> R, ’ | 7 llza ) 


indeed defines a norm. O 


3.11 Show that the space C([a,b]) of all continuous functions on 
interval [a, b] is dense in (L'((a, b], B({a, b]), A), | - |I,)- 


Solution. Let 
f € L'((a,b], B({a,b)), ). 


We show that for any € > 0, there is a continuous function H such 
that || f-HA || pn <e. For F>0 such that 
F € L}({a,b], B({a,b]), A), there is a monotone increasing sequence 
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{s,} C F, such that sups,, = F’. Thus, for a given ¢, there is an N 
such that for alln > N, 


||_F — Snll za < Ss 
By Problem 3.10, for sj, there is a continuous function h such that 


lsw — Ally: <e. 


Now, 
|F — All < LF — swllz + [lw — Alls < 2. 


Thus, for a given e, for | f | and f~, there are continuous functions, 
h and g, respectively, such that 


If] — Alla <e and |]2f> —glln <e. 


Therefore, for H = g—h, 
lf — lz. = WIfl-—2f° +9—- All 
SWF] — Alles + Wl2f- ll < 2¢. O 


3.13 Let A be a subset of [a,b] whose closure is negligible w.r.t. 
Borel - Lebesgue measure X. Is the function g = 14 Riemann 
integrable on [a,b]? If it is Riemann integrable, then calculate 


(R) f° g(a)dx. 


Solution. (i) Consider the function f = 1q. Obviously, f = 0 on 
B = A‘ which is an open set. Consequently, f is continuous on B. 
(Note that this is not the case with the Dirichlet function 1g which is 
also constant on a subset that is not open; this is essential.) Indeed, 
because f is a constant function on B, say f = c, andif x € Bisa 
point and O, is any open neighborhood of f(x) = c, obviously, 


f*(Oc) = f*({e}) = B. 
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Now, because B is an open set and x € B, it shows that x is an 
interior point of B = f*(O,). Thus, f is continuous at any point 
ye 5b. 


(ii) Because A is closed, it is a Borel subset of [a, b| and therefore, 
f is a Borel function on [a,b]. According to (i), f is continuous on 
[a,b] A-a.e. Thus, by Lebesgue Theorem 3.5, Chapter 6, f is 
Riemann integrable. By Theorem 3.2, f is also Lebesgue integrable 
and (L) { f =(R)/ f. 


(iii) Because f € [0],, 
f.fdd =0 = J’ f(x)de. 


(iv) Now, we return to function g = 14. Obviously, g = f A-a.e. 
because A C A and 0 < g < f. By Problem 5.10, Chapter 5, then g 
must also be A-measurable and thus Lebesgue integrable. From (7i7), 
in particular, it follows that i gd = 0. 


(v) Finally, is g Riemann integrable? Because g = f on B and f is 
continuous on B, it follows that g is also continuous on B and thus 
on [a,b] A-a.e. Thus, g must be Riemann integrable and from (7i7) 


and Theorem 3.2, thus J? 9(a)da =). O 


4.1 Show that (foF)t = ftoF and(foF) =f oF. 


Solution. (i) Suppose [Qo, ©, F'] is surjective. Let 


P={f > 0} = f*((0, o]) 


and 

N={f <0} 
Then, 

¢ (P) = fx(P) © (0, 00] 
and 
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Now, let Pp = F*(P) and No = F*(N). Obviously, the latter two 
sets form a partition of {29 and in as much as F’ was assumed to be 
surjective, 


F.(Pp) = P and F,(No) = N. 
Thus, 

fPoF. (Po) = froFs(Po) = fe(P) © (0, oo], 
and 

fo oF. (No) = {0}. 


On the other hand, (foF’)* (wo) = 0 if and only if (foF)(w9) < 0. 
The latter, in turn, is equivalent to (f*oF’)(wo) = 0 and this holds if 
and only if wp € No. The rest is obvious and thus (foF')* = ftoF. 


The case (f o F')” = f~ o F is proven similarly. 


If F' is not surjective, F,(Py) C P and F,(No) C N (possibly each 
being a proper subset). Either way, the equation (foF’)* = ftoF 
still holds and so does (f o F')” = f~ o F. For instance, F’ can be 
made surjective when taking its codomain for a range and consider- 
ing Resf on F's codomain instead of /. O 


4.3 Let f €C1(R", B"), aE R, b € R", and B € B”. Show that 


[flow + b)A(dx) = dy f f(x) (dx), 
B A 


where A=aB+b. 
Solution. Let L(x) = ax + b. Then, 


I= ff(ax +b)A(de) = [fo L(«)A(dz), 
B B 


Let A = L,(B). Representing the Borel set B as 
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heat oeL(bj)= (A) = 4(A — bd), 
we have 


l= | 7 oL@)r\ds) 


L*(A) 
[by (4.1a)] 


= [fla)aAl* (dz): 


By Proposition 4.3, Chapter 5, \L* = A, implying that 


— fare 


= far { f(2)\(dz), (P4.3) 


where A = aB + b. Equation (P4.3) is due to Problem 1.18, that is, 
due to [ fd(cu) =cf fd, where c > 0. oO 


5.2 Let yw and v be measures on (12, 5’) such that v < w and 
suppose jt iS o- le Show that there exists a Radon - 
Nikodym density f ev , such that O< f <1. 


Solution. Becausev < pp = v < ys. Thus by the Radon - Nikodym 
theorem, there is a Radon - Nikodym density f € a > Qe Let 


A: ={f > 1} = f*((0,00)). 
We show that A is the empty set. Let 

An, am a! — aio): 
Then, {A,,} 7 and 


Ae 


n=1 


Now, 


fl4, > 14,(1+ 4). 


CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 675 


Furthermore, 
V(An) = f fla,du > (1+ 4) u(An) 


and thus, v(A,) > w(A,) for all nm =1,2,..., which of course 
contradicts the assumption that v < py. Therefore, each A,, must be 
the empty set. It obviously cannot be a p-null set, nor can it be a v- 
null set. Consequently, A = ©. O 


5.4. Let 7, be a Poisson measure on (R, 6). Investigate whether 77, 
is absolutely continuous or singular relative to X. 


Solution. Let B= {0,1,...}. Then B is a Borel )-null set. 
However, 7.(B)=1. Hence, 7, is not absolutely continuous 
relative to X. Also, A(B) = m,(B°) =0 => AL TM. O 


5.6 Let p; < wand po L ps. Show that p; L po. 


Solution. From pz | ju it follows that there is an A such that 
p2(A) = p(A°) = 0. 


On the other hand, p; < yw implies that ;(A°) = 0. Therefore, 
pi + p2. O 


5.8 Let y and v be o-finite measures on ((2, »’). Show that ys and v 
possess densities f and g, respectively, relative to p = w+ v. 


Solution. Clearly, p(A) = 0 = p(A) = 0 and v(A) = 0. Therefore, 
ju <p and v < p. Now, because either ju and v is o-finite, so is p. 
Thus, by Radon - Nikodym theorem 5.7, there are nonnegative Radon 
- Nikodym densities, say f and g from the unique equivalent classes 


du dv : 
ie and ae respectively. O 
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5.10. Let f € 
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(Q, 3’, w;IR,, B(R+)), us be a positive finite mea- 


sure, Xiy C 2 be a sub-o-algebra, and wo = Resy, lu. Suppose 
Xp = {0,0, A,B = A°} where A is a measurable set. Define 


g= 


(ay { favo zai aBy) FAHoL 


Show that g is a Radon-Nikodym density of v = f[ fd with 


respect to jo. 


Solution. Clearly, g is y-measurable and pu-integrable. Now, 


gla = 


a 
Therefore, 
[9400 = 
and similarly 
J gd uo = 
We also have 


Jgdpo = 
Q 


Jagd = 


roa J Fdtols and glp = TR joey f Fauole. 


macy fdto H(A) = J Fduo 


A) J fd po po( A) 
A 


Therefore, [ gdjio = f fdjso on Xo. 


5.12 Prove that yw | pif and only if uA p= ?. 


Solution. (i) Let yw L p. Then, there is a set A such that 
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(A) = p(A°) = 0. 
Thus, 
O< (HA p)(Q) = (WA p)(A) + (HA p)(A*) 
< (A) + p(A’) = 0. 
Therefore, 4 \ p = 0. 
(it) Let uA p= 6 hold. If «A p= 8, then given an A € &, for 


each € > 0, 4 B such that jo(A) + p(A\B) < e. By Lemma 1.11 and 
Theorem 1.13, Chapter 5, 


0 = (uA p)(A) 
= inf{uu(B) + p(A\B): BC A} 


implies that for each ¢ > 0, B C A such that (A) + p(A\B) <e. 
In particular, for A = Q, 


0 = (“A p)(Q) 
=> Ve > 04 E such that 
w(E) + plE*) <e; 
that is, for each n, there is an E,,, such that 


UW En) + p(En) <2™. 


Let A, = U Ej, and A = {) A,. Then, 


i=n n=1 


wa) < wy) =0( U2) 


=n 


= pyar 627) < pe = a”, 
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Thus, ifm — oo, (A) = 0. On the other hand, 


p(As) = o( zt) < Bt) < § 


=n 


Vi > n independent of n. Hence, 


p(A;,) = 0 
implying that 
o(U 4,) < Ee p(4s) = (4) =o. a 


6.2 Let v be a signed measure on a measurable space (Q, X’) and 
let A be a v-finite measurable set. Show that if A = A; + Ao 
is any decomposition of A, then either A; and A> is v-finite. 
Furthermore, show that if v(A) = oo, then at least one of the 
subsets A; or Ay must be positively infinite. If A = US | A, is 
a countable measurable decomposition of a finite set A, then 
all sets A,,'s are finite. In particular, it follows that any subset 
of a finite set must be finite. 


Solution. Indeed, by additivity of v, v(A) =v(A;) + v(A2), and 
thus it is impossible that at least one of the terms on the right is 
infinite. Besides, by our earlier convention, we cannot operate with 
+00 — oo, which would make the additivity property of v inconsis- 
tent. Now, if v(A) = oo, then again due to the consistency of v, at 
least one of the subsets A; or Ay must be positively infinite. 


If A is finite and at least one set , say A;, is positively infinite, then 
g-additivity does not hold. 


Any subset S' of a finite set A is finite because A = S + (A\S) isa 
decomposition of A which we argued above. O 
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6.6 Prove Proposition 6.6. Jf A and B are measurable sets such 
that B is v-finite and AC B, then A is also v-finite and 
v(B\ A) = v(B) — (A). 


Solution. From B = (B\A) + A and additivity of v we have v(B) 
= v(B\A) + (A). By Problem 6.2, either term on the right must 
be finite. The latter allows us to subtract v(A) from either side of the 
equation to yield the statement. O 


6.8 Prove Proposition 6.8. Let (Q,2’,v) be a signed measure 
space and {A,,} be a monotone nonincreasing sequence of 
measurable sets such that for some n€ N, v(A,) is finite. 
Then the signed measure v is continuous from above on { A, }. 


Solution. (a) Assume that v is a signed measure on »’ and that 


{A,}| with |v(A1)| < co. Denote A: =) A,. Then, by Problems 
n=1 

6.2 and 6.6, |v(A,)| < oo for all n >1 and thus |v(A)| < oo. 

Because {A,,} is monotone nonincreasing, {A;\A,}f . It can easily 

be shown that 


U (Ai\4n) = 4n\( fn). (1) 


n=1 n=1 


Now we apply Problem 6.7 to the sequence {A;\A,,} and equation 
(1) to arrive at 


jim, U(A1\An) = u( U (Ai\An)) = y(A;\A). (2) 


n=1 


By our assumption v is finite on {A,,}. Therefore, by Proposition 
6.6, equation (2) reduces to 


v(Ay) — Jim. v(An) = v(A1) — V(A) 


and thereby yields assertion (7). 
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(b) Now suppose 1(A,,) is finite for some n. Then, by Problems 6.2 
and 6.6, Aj, are also finite for all k > and thus we can apply the 
arguments of part (a) to the tail of the sequence {A;: k > n} and 


have (considering that A = () Ax): 
k=n 


vy(A) = lim v(A,). C 
k — co 
6.10 Let $8 and St be the families of positive and negative sets, 
respectively. Show that $8 and St contain the empty set and 
they are closed with respect to differences and at most 


countable unions and intersections. 


Solution. We render the proof for family $B. It is obvious that the 
empty set is a trivial example of a positive set. 


(i) Let P,Q € $8. Then, from the representation 
(P\Q)NA=PN(A\Q) 

[Problem 1.2e), Chapter 1] it follows that P\Q € $B. 

(iz) Let {P,,} C $B be a disjoint sequence. Then, by o-additivity of v, 
Y((dona1Pn) NA) = dona YP 9 A) 

is the sum of measures of sets v(P,, A) > 0, so that the sum 
V((Yon=1Pa) 1A) € [0, oo]. 

(iii) If {P,,} is an arbitrary sequence of positive sets, then we can set 
Pi = Pi and PP =Pi\(o Pi) yn S 2, 


so that {P*} C 8 [by parts (7) and (i2)] and by part (72) 
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UR= DR e®. 


(iv) Let {P,,} C and denote Py = (} P;. Then, 
i=l 


BCR = v(Py) > 0. 


Furthermore, for any A € 3’, Py N A C P, implying that v(P) 1M A) 
> 0. Therefore, Pp € $B. 


The arguments about the family of negative sets are identical. O 


6.11 Let $8 and Nt be the families of positive and negative sets, 
respectively. Show that v is monotone on either family. 


Solution. Let P,Q € $8 such that P C Q. Then, from the decompo- 
sition Q = P+(Q\P) and the fact that Q\P € 8B (by Problem 
6.10) it follows that v(P) < v(Q). 


Now, let M,N € St such that M C N. Then, N = M+(N\M) is 
a decomposition of NV into two negative subsets. Therefore, v(.V) < 
v(M). So, here we have the monotone decreasing property: with 
MCN=>vU(M)> VIN). 


6.13 Show that all Hahn decompositions form a unique equivalence 
class. 


Solution. Suppose (P’, N’) is another Hahn decomposition. Because 
P\P' CP, by Remark 6.5(iiz), it is a positive set. But because 
P\P’ is also representable in the form PN (P’)© = PON’, it is a 
subset of N’ and by the same Remark 6.5(7i7), must be negative. 
Finally, by Remark 6.5(izi), (b), P\P’ is a null set. Consequently, 
P’'\P isanull set and thus PAP’ is a null set. The rest is similar. 


6.15 Show that any signed Borel - Lebesgue - Stieltjes measure on 
(R”, B) is o-finite. 


682 CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 


Solution. Recall that (IR", B(R"),v) is a Borel - Lebesgue - Stieltjes 
measure space if v is a Borel measure and it is finite on any d,- 
bounded Borel sets. Obviously, 


V(TL(- aa) <o, 


where a; = k, and that 


{TL (- ana) bt Re Oo 


i=1 

6.17 Let v € G,(O, X’); that is, v is a finite signed measure. Is it 
true that v is bounded? 

Solution. 


V(A)] < |2(A) 


=vt(A)+v (A) < v7 (Q) + -(Q) 
=»(P)-»(N) =|¥| =M, 


given (P,N) € [H*,H],. Because v is finite, v(P) < oo and 
—v(N) < oo. Thus, M is a nonnegative real number and we have 
that for each A € S’, |v(A)| < M. In particular, it follows that 
\|v"|| < 00. O 


6.19 Prove Lemma 6.25. Let 1,1) € G,(Q, 5’). Then the set 
function 


p(A) = sup{v1(E) +W(A\F): BEN A} 
is a finite signed measure on 3). 
Solution. 


(i) Obviously, p is a set function defined on »’ that equals zero at 
the empty set. Next we show that p is finite. By definition, 
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p(Q) = sup{m(F) + 14(E°): Be x}. 
Then we have 
ji (E) + (E*)| 
< |1(E)| + lv2(E°)| < |vil(@) + |v2l(Q) 


= ||"1|| + ||v2|| < 00. 


Therefore, 
|e(Q)| < [lvl] + [val] < 00. 


(it) Now we need to prove that p is o-additive. Let {A,} C ©’ bea 
sequence of disjoint sets with A: = isa Let ¢ > 0 be chosen. 
From the definition of p, given A,, and an €, > 0, there is a subset 
E,, © An such that 


p(An) st (En) at V2(An\En) + En, N= 1, 2, wens 


We choose ¢,,'s such that }>*°,¢, <¢, which we specify before- 


hand. Denote E = 5>~_,E,,. Obviously, E's are disjoint, E C A, 
and 


A\E = Yinai(An\ p= Ex) 
= Vinai(An\En); 
because the rest of the E;'s are disjoint with A,,. Therefore, with 
1 (E) + 2(A\E) 
= Vine (En) + Viner ¥2(An\En) 
we have: 


dona=1P(An) < ye 1(En) 
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+ opn1¥a(An\En) + Dipn1En 
< (EB) + V2(A\ F) re 
< p(A) +e. 


Because this inequality holds for each ¢ > 0, we arrive at the desired 
inequality 


wena1P(An) < p(A). 


In particular, we notice that }>>°, (An) < co that holds true for any 
disjoint sequence of sets {A}. 


(iti) Now, we establish the inverse inequality. We thus need to show 
that p is c-subadditive on {A,}, i.e. 


p(A) < 2% 1p(An). (P6.19) 


Let £ be a measurable subset of A. Then, 
(EB) + 12(A\E) 
= 11 (Yona (An 1 B)) + oY (An 9 E*)) 
= ne (An NB) + ¥2(An\(An NE) 
< VnP(An) 


which yields (P6.19). The series }***_,p(An) < 00 as noticed from 
part (27). On the other hand, it also follows that 


—00 < Di p(An)- 


So, we are done with the lemma. O 


6.21 Let v,p € G,(Q, ©). Show that v + p € G,(Q, 2’). Is it true 
that 


v+p=(v+p)°—(Vt+p) 
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=vt+pt—vy—-p? 


Solution. Obviously, v + p is a finite signed measure through its re- 
presentation vy + p=v*t+pt —v~ —p-. Given a measurable set 
A, we define (v + p)(A) = v(A) + p(A). However, 


vi + pt—v-—p 


need not be a Hahn decomposition of v + p. This is because the 
Jordan variations v*,v~ and p*t,p relate to possibly “entirely” 
different Hahn classes [H",H™], and [H*, H™],. Or equivalently, 
v* + p* need not be orthogonal to v- + p~, even though v* | v~ 
and p* | p~ hold. O 


ead 


6.23 Show that G,(, »’) is a vector space over the field 


Solution. The statement is in part due to Problems 6.21 and 6.23. In 
addition, the zero measure @ must be included. im 


7.1 Show that the total variation of a complex measure € has the 
property, |€| < vj +v; +vz +v~ and that in general, |é| < 
vi +v7, +vz +v~. Thus, it disagrees with the associated 
property for a signed measure. 


Solution.(i) First, from (7.3a) we have that 


|E(A) < |rn|(A) + |r2|(A) = (vf + op + vg + v2) (A). 


(ii) The reason why |é| < vj +7 +v3 +v° is purely algebraic. 
For a set EF, 
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x |v (B) — vy (E)| + (Vy (B) — (BE)? 


Consider the complex measure space ([0, 4], B([0, 4]), €), where 
E =X- 2€o + i(e, = EQ). 


It is easy to verify that € is a complex measure as it has two signed 
measure components. Consider the partition 


E, = {0}, Ey = (0,2), E3 = [2,4] 


of [0,4] and calculate S~*_, |€(E;)| = V/2 + 2/5. It can be readily 
shown that 


61(0,4]) = Dele, 
whereas the sum of all Jordan variations of set [0, 4] is 
(A + 2€9 + €1 + €2)((0, 4]) =... O 


7.3 Prove Proposition 7.3. Any complex measure & is continuous 
from below. 


Solution. Let € be a complex measure on ' and {A,,} a monotone 


(oe) 
nondecreasing sequence in »’ such that A: = J) A,,. Denote 


n=l 
B, = Ay \ Anis n=1,2,..., Ao =. 


Then 
Ay = a B k 


and { B,,} forms a pairwise disjoint sequence from >’ with 


dina Bn = A. 
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Because |€(A,,)| < oo, (Bn) is well defined and, therefore, due to 
o-additivity of € and because |£(A)| < 00, 


fim €(An) = Jim, 2n=1 &( Br) 
= re §(Br) 
=€ ODam Br) = &(A), 
which yields continuity from below. O 
7.5 Prove Proposition 7.5. Let € be a finitely additive set function 


on the measurable space (Q, 5’) such that €(@)=0. € is 
continuous from below if and only if € is o-additive. 


Solution. (i) Suppose € is continuous from below and let {B,,} be 
a sequence of pairwise disjoint measurable sets from >’. Let 


An — yee 


Then, {A,}7 CX’ and A= U A,, equals )>*~ , B,,. Due to conti- 


nuity from below of € sal. rie its finite additivity we have o- 
additivity: 


E(A) = En = litt, -ses6( Ay.) 
(ii) Let {B,,} be a sequence of measurable pairwise disjoint sets. 
We form the sequence {A,,}7 as in part (i). Then {A\A,,} | @ and 


continuity from above yields @-continuity and the latter in turn yields 


Jim, €(A\An) = €(@) = 0. 
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Because the € is finite on 4’, by Problem 7.2, the last expression 
leads to 


jim, €(An) = €(A). 


Applying equations (1) we arrive at o-additivity of €. 


(iti) Suppose € is @-continuous from above. Let {A,} be a 
monotone nonincreasing sequence of measurable sets and let 


A=()A,.Let B,: = A,\A. Then, obviously, () B, = @. Now 
n=1 n=1 
we have 


(An) = €(Bn + A) = (Bn) + €(A). 
By the above assumption, 
limn—oo€(An) = limn0o€(Bn) + €(A) = 04 €(A). 


Thus, € is continuous from above. By part (ii), € is therefore o- 
additive. O 


7.7 Any complex measure € € @((, 5’) is finite. It means that for 
any measurable set A, |€(A)| < oo. (a) Does it mean that € is 
bounded? (b) Let €=1v, +i, where 14,1 € G,(Q, 2’). 
Show that for any A € 37, |€(A)| < ||| + ||v2]]. 

Solution. Given A € 3) and € = 1, + iv) we have 
JE(A) FP = [rn (A)? + [r2( A)? < (1 (A)| + |r2(A)I)° 

and thus 
|E(A)| < [1 (A) + |v2(A)| (P7.7) 

S [va |(22) + Jv2|[(Q) = |lval] + [lvl 


The latter is a real number, according to Problem 6.17. O 
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7.9 Let €,6,& € @(Q, 2’) such that for each A € SY’, |€(A) 
|1(A)| + |§2(A)]. Prove that ||§|| < [l€1|| + llé2Il- 


/\ 


Solution. Let S(A) C »’ be the family of all finite subpartitions of A 
and let {F,,...,£,} C S(A). Then, 


ye) = a ei) 
+ iii |€2(E:)| 
< |€1|(A) + |€2|(A). 


Therefore, 


|EI(A) S 1&:](A) + [&2/(A) 
implying the desired inequality ||€|] < ||&1|| + |||. O 


7.11 Let € =v +p be a complex measure on a measurable space 
(Q, 5’). Prove the following inequalities for total variations 
and norms. 


IE] > Sy (l4l + lel) and [Ell = Je (lll + llell). 


Solution. We first notice that if € = y+ ip is a complex measure, 
then for each measurable set A, 


V/2|€(A)| = |v(A)| + [p(A)I.- (P7.11) 


Furthermore, let S(A) be a set of all finite measurable subpartitions 
of set A and let {F\,...,£,} C S(A). Then, by (P7.11), 


ini lY(E:)| + 5-1 lo( Es) 
< VAC NEE) < V2I(A) 
Therefore, we conclude that for any measurable set A, 


Iv\(A) + [ol(A) < V21EI(A 
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The similar normed inequality follows immediately when setting 
A=. O 


7.13 Prove Theorem 7.8 that (C(Q, ’), || - ||) is a Banach space. 


Solution. Let {&, = Un + tpn} C C(Q, =’) be a Cauchy sequence. 
Then, by Problem 7.12, {v,,} and {p,} are Cauchy sequences of 
signed measures. By Theorem 6.31, we have the following. 


(i) {v,} and {p,} converge to signed measures v and p uniformly 
setwise, 


(it) G,(Q, &’) is complete and {v,,} and {p,,} converge to v and p 
in the norm. 


Therefore, we define € = +7. Obviously, € is a complex mea- 
sure. Now, &, — € = Up, —UV +i(pn — p) and thus by Problem 7.8, 
for each ce > 0, 


ll&n — Ell < lun — vl + [lpn — ell <e, n= N, 
implying that C(Q, >’) is complete in || - ||. O 


7.15 Show that, given a signed measure v, it holds true that 
(0,413,138) = (0,27, |v). 


Solution. By (7.9a) f € L1(Q, ©, v;iR) if and only if 


fe VOY 2) nL O27 =: R). 


On the other hand, because |v| = v* + v~, by Example 1.9 (iv), 
S\fldlv] = Jlfldwt +v~) 
= /|fldvt + f\fldv-. 


Therefore, f or | f| € L1(Q, X’, |v|;R) if and only if 


Fe LO 2 hit (O.5.y eR). O 
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7.17 Let (Q, 27,1) be a measure space, with j. being a positive 
measure. Prove that for each fe L1(0,2,u;C), |€| = 


J\fldu, where € = f fdy. 
Solution. (1) First we show that |g] < [|f|dy. Let Ae »’ and 


{F1,...,E} € S(A), where the latter is the set of all measurable 
subpartitions of A. Then, by Problem 7.14, 


(EB) = | Pde 


i 


< f\fldu. 
A 


(ii) We show the inverse inequality. Define 


and 9 = FLypisoy- 


Then, because G, as a complex-valued function, consists of four 
positive components (st,s ,¢*,t~), there are four monotone 
increasing sequences of simple functions that converge pointwise to 
(s*,s_,t*,t—). In other words, there is a sequence {g,,} of “comp- 
lex-valued simple functions” that converge pointwise to G. We can 
say that a sequence in polar form 


Mn (rn) i0” 
{ Gn = re os | 
J 


converges to G(w) = r(w)e”™) pointwise. [Notice that r(w) = 
14) ¢()|0}-] Therefore, the sequence of functions {g,,f} converges 
pointwise to the function Gf =|f|1,)30}. Therefore, by the 
Lebesgue dominated convergence theorem (for complex-valued 
functions, as of Problem 7.16), the integral [g,,fdju converges to 
JG fd and the latter obviously equals 


Jofdu= f\f\ldu. 


Notice that because |g| = 1 on {|f| > O}, the sequence {g,,} must be 
bounded and so is {g,, f} pi-a.s. on 2. 


Now, if g,, is in the form 
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Mn , (n) — Mn re a i” 
Gn = aa 17; 1 4 = ae {7 1m, 


then we have 


m ig” 
Jonfdn) = D> a : be 9; Vy fdu 
yo e(A nA” ’) | 
nS lel(A nA ) 
= ral éI(A), 7.17) 
where 
= max { r\" :7=1, ‘sialon 
Because rw) — 1 for all j's [because r = 1 on the set {|f| > 0}; 


because we deal with an inequality, we can assume r= 1 
everywhere on (2], so does r,, — 1. So, the right-hand side of (P7.17) 
converges to |€|(A), whereas the left-hand side of (P7.17), by the 
Lebesgue dominated convergence theorem (as noticed) converges to 
J\fldu. This shows the validity of the inverse inequality and 
A 


herewith the equation 


|E|(A = JIfldu, forall Ac X. oO 


7.19 Let (Q,27,v) . a signed measure space and f € 
(Q, 53, R,). Define p = [fdvt — [fdu-, p, = f{fdvt, 
and pz = ii fdv~. Prove that p; and p2 are the Jordan varia- 
tions of p. 


Solution. The set function p is a signed measure as long as at least 
one of the integrals p; or p2 exists, which we assume to be the case. 
Now we need to check if p; 1 po. 


(a) Let g = 14, where A € » is fixed. Define 
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PL = fgdv* = Jladv* 


and 
px= {oa = f[ladv-. 


vy) 


Then, for some Hahn pair (P, N) € [H*, H7] 
pi(N) =v*(NN A) =0 
and 
po(P) =v (PNA) =0, 
because v* and v~ are both positive measures and thus are 


monotone. Therefore, p; | p2 through (P, N). 


(b) Let g = >>” ,a;14,. Then, 
pil-) = Seyavt(- 1 A)), 
po(-) = 2 av (- 9 A;). 


vy?) 


Again take for some Hahn pair (P, N) € [H*, H-| 
inal (NN Aj) 
soe (Poi A;) = 0. 


Therefore, in this case p; | po. 
(c) Let {g,} C W, be a monotone nondecreasing sequence con- 


vergent pointwise to an f € C;' previously selected. Then, with 
p, = J fdv* and po = [ fdv~ we have 

pi(N) = sup f lwgndv* 
p2(P) = supf1pg,dv~ =0 


due to case (b) for each g,,. Thus, ; L p2. Therefore, p = pi — p2 is 
the Jordan decomposition relative to the class [H*, H~],,, as per Pro- 
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position 6.19, and p; and p2 are indeed the Jordan variations of the 
integral measure p. O 


7.21 Let (Q,2',v) be a signed measure space and f € 
L'(Q, ©;v,R). Prove that f = 0 v-ae. if and only if [ fdv 
A 


= Oforall Ac »’. 


Solution. Let p= [ fdv = [ fdvt — [fdv~ = pi — po, where p; 
and 2 are signed measures, because f is an extended real-valued 
function. According to Problem 7.16 which also holds for signed 
measures, |p;| = [|f|dv* and |p.| = [| f|dv~. Furthermore, 


lai| = f\fldv* and |p2| = f|fldv- 


are positive integral measures with respect to the Jordan variations 
v* and v~. Because | f| € C7", by Problem 7.19, |p;| and |p2| are the 
Jordan variations of the integral signed measure 


lai] — |p] = fl flv — fl fldv- 


and thus, the latter is 7: = [|f|dv. Furthermore, as the Jordan 
variations of the signed measure 7, |p1| L |p|. 


Now, if p = @, as by the assumption that p(A) = 0 for all A € XY, 
we conclude that p; = po. But then also |1| = |p2| and because they 
are orthogonal, as we argued in Problem 7.20, |1| = |p2| = 0. This 
in turn yields that |f| = 0 vt-a.e. and |f| = 0 v~-a.e. Consequently, 
|f| =0v-a.e. and so is f = 0 v-ae. 


The converse of this statement is trivial. O 


7.23 Prove the monotone convergence theorem for a monotone 
nondecreasing sequence of nonnegative measurable functions 
{fn} on a signed measure space pointwise convergent to 
f €L(Q, X;v,R,), namely, 


lity os, jae = | fav. 
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Solution. Because sup{ f,dv+ exists and equals [ fdv+ by the mo- 
notone convergence theorem with respect to positive measures and 
so is sup f,dv~ = { fdv~ and if each is finite, we have 


lim, «| fpav— [fdv’ = [fdv~ = [fadv. 


In the event one of the limits (just one only) is infinity, sup [ f,dv 
still makes sense and it will equal plus or minus infinity. 0 


7.25 Let (Q, ’) be a measurable space and v and pbe two signed 
measures on (2, X’) such that v + p is defined. Suppose a and 
b are two real constants. (7) Is the linear combination av + bp 
always defined? (iz) If av + bp is defined, show that it is a 
signed measure on (9,57) and prove that for any f € 
ENO, a7; V, R) a LO, 27; P; R), 


{J fd(av + bp) = af fdv + bf fdp. 


Solution. (i) No, it is not always defined if v + p is defined. For 
instance, suppose for some A € 3’, v(A) = p(A) = oo. However, 
fora = 1 andb = — 1,v — is not defined at A as the difference. 


(iz) If av + bp is defined, the statement follows from Problems 7.22 
and 7.24. O 


7.27 Let (Q2, 4’) be a measurable space and € and 7 be two complex 
measures on (Q2, »’). Show that € + 7 is a complex measure on 
(2,2) and prove that for any f € L1(0,7;€,R)N 
D(Q, 33, R), 


Sfd(€+n) = ffdé+ f fdn. 


7.30 Let (Q,,v) be a signed measure space and f € 
L'(Q, ©;v,C). Prove that f = 0 v-ae. if and only if [ fdv 
A 


= Oforall Ac ». 


696 CHAPTER 7. SOLUTIONS TO SELECTED PROBLEMS 


Solution. Let f = u + iv. Then, by Definition 7.9 (iv) (in our special 
case of signed measure v), [ fdv = fudv +ifvdv. This complex 
integral equals zero if and only if fudv = [vdv = 0. Applying the 
statement of Problem 7.21 to each integrand we arrive atu =v = 0 
v-a.e. The converse of this statement is trivial. O 


7.31 Let (Q,2°,€) be a complex measure space and f € 


L'(Q, ©';v,R). Prove that f = 0 €-ae. if and only if [fdé 
A 


= Oforall Ac ». 


Solution. From Definition 7.9 (iv) we have | fd€ = [ fdim+ 
if fdvy. This complex integral equals zero if and only if f fd, = 
J fdv. =0. Applying the statement of Problem 7.21 to f we have 
f =014-a.e. and 1)-a.e. The converse of this statement is trivial. UO 


7.33 Show that (£1(Q, X;v,R), +, -, X ) is a vector lattice over 
R. 


Solution. (i) L'(Q, ©';v,R) is a vector space. This is due to the fact 
that for each f € L'(Q,2;v,R), [fdv is finite and it is a linear 
functional. 


(ii) If f,ge€ L'(Q,X;v,R), then so are f + g and |f —g|. The 
latter is due to Problem 7.32. Then, because by Problem 7.17 (7), 
Chapter 1, 


fV9=5(ft+9+|f -gl) 


and 
fAg=5(ft+g-|f-gl), 


we have f Vg, f Ag € L1(Q, X;v,R). oO 


7.35 Let € € C(O, »’) (be a complex measure). Show that a function 
f €L'(O, 5; €,C) if and only if |f| € £1(0, ©; €,R,). 


za 
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Solution. By Definition 7.9 (iv), 


f=utive L1(Q, 2; €,C) 


if and only if u,v € L'(Q, ©; €, IR). On the other hand, by Problem 
7.34, u,v € L'(Q, X; €,R) if and only if |ul, |v] € £1(0, 3 €,R,). 


wz 


By Definition 7.9(izi), as well as by Problem 7.35, 


EMO, LE, Ry} 


Al 


= 1'(9, 23m, RK.) L'(Q, D3, ). 


Furthermore, by Problem 7.15, [1(Q, Y,v;R) = L1(Q, &, |v; 
implying that 


Al 
od 


Al 


TO, Ig, 


) 
= IA (O), 23 JR) AN £1(Q, 2; |ve|,R+). 


On the other hand, £1(Q, ©; |~|,R,) and £1(Q, ©; |v2|, Ry) are 
semilinear spaces over the semifield R1, which yields that with 


+) 


Jl 


Jul, Jo] € LIQ, 35 |ri],Ry) 9 £1(Q, 2; |v], 


we also have 


Jul + |v] € £°(Q, 25 |r|, Ry) 9 LQ, 25 |v2], 


Al 
JU 


Finally, the statement is due to the following inequality: 
g(lul + lol) < Lf] < lel + lal. O 
8.1 Prove Proposition 8.2. Let v be a signed measure and 1 - a 


positive measure on a measurable space (Q, 5’). Then the 
following statements hold. 


@) veep So <pandy <p=>|y| <p, 
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(ii) vi <pandvo <p Sv <yp and |p| <p. 
(wi) vpl<poav<pandv <p >v<cup. 


Solution. (i) Suppose v < yp. Let (P, N) € [H*, H”],. By Defi- 
nition 6.15, 


vt (B) =v(POE) and v (£) = —v(PNN), Fe. 
Suppose F is a j-null set. Then, by monotonicity of positive meas- 
ures, U(P ME) = u(N ME) = 0. Thus, due to absolute continuity 
of v, 

0=0(.PN 2) =r (2) =v (2) = p24) =0. 
Because |v (E)| < |v(E)| we also have v(E) = 0. 
(it) Supposev* <p and v~ < yand p(£) = 0. Then, immedi- 
ately, |v|(E) =0 and due to |v(E)| < |v(E)| we also have 
vi) = 0, 
(iti) Suppose |v| < yu and p(E) = 0. Then, 

0 = |p|(B) =v (BE) + ¥-(B) 


yields 


also implying that v(E) = 0. O 
8.3. Let (9, 5’, €) be a complex measure space. Show that € < |§]. 


Solution. Recall that |€(A)| < |€|(A), for each A € &, and let € = 
V1 + ivy. Suppose |€|(A) = 0. Then, |€(A)| = 0 and thus we have 


\E(A) |? = (4(A))? + (2(A))? = 0 


implying that 
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(A) =i,(A) =0 = v(A) =0. CO 


8.4 Let € € @(, »’). Show that the Radon-Nikodym derivative 
a of € with respect to |g| — and that the modulus of any 
Radon - Nikodym density from + ria equals 1 |€|-a.e., where 1 is 
the function defined as 1(w) = 1,w €. 


Solution. Because by Problem 8.3, € < ||, the Radon -Nikodym 
derivative aH of € with respect to |€| exists, by the Radon - Nikodym 
theorem (Theorem 8.9) and it is a subset of L1(Q, 7; €,C). This is 
because |€| is finite as the total variation of complex measure €. 
Now, by Problem 7.17, 


lel = sag lal. 
Therefore, af is the Radon-Nikodym derivative of |é| with 
respect to |€|. Furthermore, because 


Ie = fldlél, 


it follows that 1 € a Consequently, all other Radon -Nikodym 


densities must be equal to 1 |£|-a.e. O 


8.6 ma € € C(O, »’). Show that the Radon-Nikodym derivative 


rial exists and that for each g € L(, X; €,C), 


Sodé = Soae all. 


Solution. Because by Problem 8.3, € < |&|, by Theorem 8.11, given 
any g € L1(Q, XY; €,C) for any f € FEE 


Jodé = fof dl€|. O 
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8.7 Let ys and p be two o-finite signed measures on a measurable 
space (2, 5’) such that 4. © p (meaning w <p and p < p). 
Show that 


4 
d dt 
ai = (¢) ji- (OF p-) ae. 


Solution. We apply the chain rule (Theorem 8.11) replacing € in 
€< p< p with p. Then, we have 


dp _ dp du 
dp du dp* 


On the other hand, p = [ dp implies that 1 € e and therefore, 


dp du 

1eE dj dp* 
In other words, for all Radon - Nikodym densities from de and ce 
a p 


there are some whose product equals 1. Consequently, the rest of the 
Radon - Nikodym densities forming the above two classes are such 
that their respective products equal 1 p- and therefore ju-a.e. O 


8.9 Let (Q,2',P) be a probability space under the following 
assumptions. Let Q = [0,1), © = BN[0,1) and P = ResyA 
(where A denotes the Borel-Lebesgue measure). Let H = 
{H,,...,H,} bea finite partition of Q such that 


Hy, = (1, *),b=1,.4,%, 
and let 39 be the smallest o-algebra generated by H. Let 
X(w) =w, for all we. Calculate a version Xo of the 
conditional expectation E[X|’o| and give all other versions of 
E[X| Xp]. 


Solution. Obviously, we deal here with a special case of Example 
8.15, whose results we will utilize throughout this solution. First, 


P(A) = 5, 


and 
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ee XdP = ee w A(dw) = 


Thus, from (8.15c), 


E[X|H] = pay, [XaP 


and we have 


E[X|Hy] = 45 n = 4 


From (8.15e), 


Xo = Vina E[X| An] 1 


and we have 
_ yon 2k-1 
Xo — 22 1° 2n 17,; 


as a version of the conditional expectation 


a ; 


E[X |X], where 


Xo = 0(H) = {Q, GO, A: A is any union of H;'s}. 


The same arguments as in Example 8. 


clusion that X above is the only version of E[X|5 


15(2), lead us to the con- 
70]: oO 


8.11 Explain exactly why X 4 E[X|Xo] if X is not Xo-measurable. 


Solution. If X is not Xiop-measurable, then even though the equation 


{XdPy = [Xod 0 for all A € Xp 
A A 


holds, the requirement of Corollary 1.21 (that X be »o-measurable) 


is not met. 


8.13. Prove Proposition 8.17. Let X,Y € 
dip © SX? be a sub-o-algebra, and P 
two real numbers a and b, 


O 


(Q, a, ah R, B(R)), let 
= Resy,P. Then, for any 
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E[aX + bY |X] = aE[X|Xo] + bE[Y |X] Po-a.s. 


Solution. Let (aX + bY), be a Radon-Nikodym density of the 
integral measure [(aX + bY)dPo w.r.t. Po and let Xo and Yo be 
Radon -Nikodym densities of [XdPo and {YdPo, respectively, 
w.r.t. Po. Then, we have 


[(aX + bY)dPy = [(aX + bY),dPo (1) 


as integral measures on X. Also 


{Xd Po = J XodPo and 1¥a Po = {Yod Po. (2) 


By linearity of the integral, the latter two equations (2) yield 


[(aX + bY)dPy = f (aX + bYo)dPy. (3) 


Comparing (1) and (3) we have 


J (ax + bY) d Po _ [(aXo oh bYo)d Po (4) 


good on Xp as measures. As integrals, because both integrands are 
Sig-measurable, (4) implies that (aX + bY), = aXo + bYo Po-as. 
which exactly is the statement if we turn to the corresponding 
notation. U 


8.15 Prove Proposition 8.19. Let X ,,X2,.... be a monotone 

nondecreasing sequence of nonnegative r.v.'s_ from 
(Q, ©',P;R,,B(R,)), let Xo C XY be a sub-o-algebra, and 
Py = Resy,P. Then 


limp 0E[X;| Xo] = EllimyoXn|X'o] Po-a.s. 


Solution. By Radon - Nikodym, for the given sequence {X,,}{ there 
is a sequence {X°} of Radon - Nikodym densities, which is Po-a.s. 
pairwise monotone, according to Proposition 8.17. There exist point- 
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wise limits X = lim,,...X, and x = irs eee: toe On the other 
hand, applying the monotone convergence theorem (MCT) we have 


limp f XndP = lim, oc f XndPy = [ XdPy = vo. 


Thus, there is a Radon-Nikodym density, say Xo, of the integral 
measure Vp relative to Po such that 


{Xd Po = J XodPo on D0: (1) 


We will show that Xp) = X : (mod Po). 


Because X° is a Radon-Nikodym density of the integral measure 
J X,dPo relative to Po, we have 


f XndPo = f XPdPo on X% 


and hence 


lim soo f XndPo = limy soo fX°dPo, VA € Zp. 
A A 


Then, by the monotone convergence theorem, 


i XdPy = ‘ X°dPy), VA € Xp. (2) 


Comparing (1) and (2), we have 


f XodPo = [X dP, VA € Xp, 
A A 


which yields by Corollary 1.20 that 


Xo= ve (mod Po). 
In a nutshell, we have 


{X?} —_ {ELX;,| Xo] } 


and 
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limp socE[Xn|] = X°- 


On the other hand, we had 


Elim, won| 200! = E[X| Xo] = Xo 


and proved that X) = X : (mod Po). O 


9.2 Prove that 


Ag, eA 
Sa (At " Ay) a (A; x Aa)a, ~ . 


0, a, ¢ Ay” 


Solution. By the definition, 
£a,(A) = {we € OQ : (a1, y) € A}. 


Thus, 
Eq, (Ai x Ag) =— {we E Qo 1a, € Aj, We — Ay} 


Ao, aye A, 
O, ag ¢ A, : 


O 


9.4 Show the validity of assertion a) in Lemma 9.9. Let &4, be aj- 
sections for some a; € Q;, i =1,2.Denote 3” = {A € Ne: 
£a,(A) = Aa, € 2}. Then X" is a o-algebra in Qy ® Qs. 


Solution. (i) Q=Q,xQ2€e 2X" because, by Problem 9.2, 
Gal) = Oe: 


(ii) Let A € &”. Then, €,,(A) € 279 and [€,,(A)]© € 2. By Prob- 
lem 9.3, [&,(A)]° = £2,(A‘). Hence, &,,(A°) € 2’) and consequent- 
ly, A° € X”. In particular, it follows that @ € X”. 


(iti) We show that 7’ is closed w.rt. countable unions. Let 
{A;} C 2”. Then, by Problem 9.3, 
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fa, ( U A) = U &a,(An) € 2X. O 
k=1 k=1 


9.6 Show that the Borel- Lebesgue measure \” on B" and the 
product measure & A; on Bg must be identical. 


Solution. In Example 9.7 (7) we showed that B” = Bs. Furthermore, 
A” and 8, A; coincide on the semi-ring S”. Recall that \" emerged 


from lementady content \°” having the property that 


mr n 


r™((a™, b"]) = [] (bi — as) = [Ao (ai, 8:)). 


i=l i=1 


The same result for ® ri A; is due to Theorem 9.11. Now, obviously 
the premeasure \" on the algebra A(S”) and the restriction of 8, Vi 


on the same algebra A(S") must coincide. By Theorem 9. 12 (or 
Corollary 2.14, Chapter 5), the product measure 8X on B" i 


unique and thus must be equal to \”. O 


9.8 Let (Q;, X7;),7=1,...,2 +k, be measurable spaces. Prove 


that 
nt+k n nt+k 
® Y= {| ® YY; |] @ @ JD; |. 
i=1 i=1 i=n+1 


Solution. Obviously, 


n nt+k 
@XuwHI@( @ dX; 
i=1 i=n+1 


is generated by the measurable rectangles A”) x B“), where 


nt+k 
AM € @ 3; and BY © @ DX. 
i=l i=nt+1 


Thus, the generator x i "3 of | e +}; is a subset of the generator 


i=l 
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n ntk nt+k 
i=1 i=n+1 i=n+1 


Consequently, 
nt+k n nt+k 
@5c(@s,)@(‘@ x). (9.8) 
i=1 i=1 i=n+1 
Let 
nt+k 
BE® +; 
i=n+1 


be a fixed measurable set and let 


nt+k 
safae 6 5: Ax Be’ 5} 


n+k 
Then, ©” is a o-algebra that contains X »/;. Indeed, 
(i) x ¥, € XS’, because a quasi-rectangle 


i=l 


n+k 


A, X---xA,xBe © J}. 
i=1 
In particular, 
I] Q; € nye 


i=1 
(iz) If A € &”, then A € ® Sand thus Ac € ® 5). Furthermore, 
i=1 i=1 


A°x R= a eer (A®) fs T reid eR (B) 


— TG ich (II as) \4 M Tintin ameey) 


— ria (tI 95) \Th... .n} (A) Aq Tintt,,..ntk}(B) 
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= (hh eee <1) eB) A&B). 


The latter is the difference of two sets, with either one belonging to 
n+k 
@ 3. Thus A° € &”. 
i=1 


— 


(iit) Let $1, Sp,... © 27”. Then, by Problem 5.6(ii), Chapter 1, 


oo ec) n+k 
(Us) xe=Utsx 8) <8 5, 
i=1 i= i= 


i=1 


in as much as each S; x B € &” by the definition of ©” and the 
above assumption. 


Because x +, € 5 and x 57; is a generator of Q »/;, we have the 
{= od i=1 
latter be a subset of ©”. On the other hand, 7” C & >); by the 
i=1 


definition of 5’. Thus, X’ = 5). 
i=1 


— 


n+k 
Now, ” was defined w.r.t. a fixed set BE @ 


w=n+ 
same applies to any measurable set B and hence for every 


»/;. Therefore, the 
1 


n+k nm 
Be ® SX;andeveryA€ @ Jj, 
i=l 


i=n+1 
n+k 
AxBe ® Jj. 
i=nt+l 
Consequently, 
n n+k n+k 
( x) x ( @ x) Ce Sy, (9.8a) 
i=1 i=nt+1 i=1 


Because 
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is a generator of 


n n+k nt+k 
(3 x) ® ( ® x) GS © 25 (9.8b) 
i=1 i=n+1 i=nt+1 


Now, (9.8) and (9.8b) yield that 


n nt+k nt+k 
Coon mcr : 
i=l i=n+1 i=n4+1 


9.10 Let (Q:, 5%, 4:), 7 = 1,2, be measure spaces with o-finite 
measures and let Ac », ® Xy. Show that the following 
statements are equivalent 


(i) 1 @ po(A) = 0, 
(i7) pe(Aa,) =0 pai-ae. on D1, 
(iit) pus(Ag,) = 0 pig-a.e. on Qo. 


Solution. (i) = (it). Let f = 14 and suppose 41 ® ju2(A) = 0. Then 
f € Cy(Qy x Qe, XY, ®@ YX) and by Tonelli's Theorem 9.17, 


0 = p11 @ plo(A) = f fdin @ po 
= Jf fa, (W2)dp2(w2) dyn (a1) 


(due to Problem 9.5) 
JJ 1g, (4) (2) die (w2)) dy (a1) 


= J t2(&a,(A)) pr (dar). 
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Because by Lemma 9.10, the function aj +> [2(Ea,(A)) € Cz", by 
Lemma 1.15, j12(€.,(A)) = 0 p-a.e. on D4. 


The converse (ii) = (i) is again due to Lemma 1.15. 

The proof of (7) <= (diz) is identical. 

Finally, (4) = (éét) & (di) yields (ii) © (di). Oo 
9.12 Show that a, (As) = (f*(Az))ar » A3 C Qs. 

Solution. We need to show that for each subset A3 C Qs, 


(£a,0f) (Az) = €a,(f*(A3))- (9.12) 


First notice that €,, does not act like the usual operator. The reader 
can easily interchange the order on the left of (9.12) to see that it 
would not make any sense. However, the notation is still more 
convenient and intuitive. From the definition 


Ea of = {(we, f (ai, wa)) : we € No} 
we have 
(£a,0f)" (Az) = {w2 € Qo: f(a1,w2) € Az}. (9.12a) 


On the other hand, the right-hand side of (9.12), according to the 
definition of the section operator applied to set f*(A3), is 


Ea, (f*(A3)) = {we € Qe : (a1, 2) € f*(As)}. (9.12b) 
Now, because maps preserve the inclusion operation, we have 
fat (G1, W2)} © frof*(As) C As. (9.126) 


The second inclusion is due to Problem 2.6, Chapter 1. Inclusion 
(9.12c) is equivalent to f(a1,w2) € As, which makes (9.12b) 
identical to (9.12a). O 
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9.14 Show that &,0(af + 8g) = a&.,0f + P&a, ° 9. 
Solution. We have 


£a,0°(af + 8g) = {(y, of(ar,y) + Bo(a,y)): ye Yh 


which obviously is a linear combination of two functions from Y to 
Z and thus the equation holds. O 


9.15 Let f €C7'(Q) x Ny, YX, ® Xy) and let a sequence of simple 
functions {s,} C W.(Q) x Q2,Cz') be such that f= 
sup{s,}. Show that f., = sup{(Sn)a,} (Hint: Apply Theorem 
6.4, Chapter 5, and Problem 9.14). 


Solution. By Theorem 6.4, Chapter 5, f,, is literally a function of one 


variable, which, by Proposition 9.15, is 27)-)’3-measurable and so 
are functions (s,,),,. Because f is a pointwise limit of {s,,}, so ob- 


viously is f,, a pointwise limit of {(s,),, }- oO 


9.17 Let G, and G2 be semi-rings on Q; and Qo, respectively. Is 
G, X Go also a semi-ring on Q) x Q2? 


Solution. The answer is “yes.” (i) Let A = A, x Ap € G, and B = 
B, x By € Gy. Then, by Problem 5.10 a), Chapter 1, 


AN B=(A,NB) x (A.M By), 
which shows that G, x Gs is closed w.r.t. finite intersections. 


(ii) Let A= A, x Ag €G, and B= B, x By € Go. Then, by 
equation (9.3a) of Lemma 9.3, 


A\B = [(Ai\B1) x Aa] + [(A19 Bi) x (A2\B2)] 
(because G, and G» are semi-rings) 


= (0S: x Aa) + [(ALN Bi) x BE Sil 
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= ES; x Ae) + Se [C419 Bi) x Si] 
= DEG, 
where G; € Gi X Go. O 


9.19 Let (Q,2',) be a o-finite measure space and let f € 
Cz'(Q, Z). Prove that 


Jfdp = fouf > x})Xdz). (P9.19) 
(Hint: Use Theorem 9.11.) 


Solution. We use the change of variables formula in the following 
form 


Joo fdu= fgduf* 
or more specifically 
J go f(w)u tC: g(x) f* (da). 
In our case, with g = J (identity function) and f(x) = 0,x < 0, 


J{f(w)u(dw) = fro tu f* (dz) 


= frog (10, 2] )wf* (der). 


If we define set A = {(z, y) € R2 : 0 < y < x}, we can identify its 
x-section [0,2] = A,. So, we then have 


(uf* @ A)(A) = f°, A( Az) wf* (dz) 
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using the second iterated integral of formula (9.11) 
= feng ME (Ay) Ady), 


where A, = [y, co). The latter is legitimate, because is also o- 
finite. 


We remark that if X = f is a random variable on a probability space 
(Q, X(Q),P) such that X >0 as. and P= yp, then the above 
formula (P9.19) reads 


EX = [XdP = [~, P{X > x}X\(da) 


and gives an alternative computational formula for the expected 
value of X. oO 


9.20 Under the condition of Problem 9.19, let g: Ry — R, bea 
continuous monotone nondecreasing function such that 
g(0) = 0 and which is continuously differentiable on (0, 00). 
Show that 


So(f)du= 7 Lae v)u({f > x})A(dz). 
0,00 


Solution. We use the change of variables formula in the following 
form 


Joo fdu= foduf* 
or more specifically 


{9° fw)n(a = [ole )f* (da). 


Now, we can have 
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g(a) = fag (t)dt = v,((0, x]) > 0 


(because g’ continuous and nonnegative) and so 


J{fw)u(dw) = Joo ((0, 2])uf* (de). 


If we define set A = {(x, y) € R4.: 0 < y < x}, we can identify its 
z-section as [0,2] = A,. Obviously, the measure v, is o-finite. So, 
we then have 


(uf* @ vg)(A) = fr yvg(Ac)uf* (dar) 
using the second iterated integral of formula (9.11) 
= fold (Ay)dvg(y) 
and then Theorem 8.10 
= frnoltf* (Ay) a (y)A(dy), 


where A, = [y, 00). O 
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: =” reads “set by definition” 


NI XA NNN DADA ADAAADAADTAAAAAADAA A DW 


1 (end of a proof, remarks, examples) 


— 
So 


liminf A, (lim A,,) 
nm CC 


— 
So 


liminf A,, (lim A,,) 
nC 
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[P(Ry), P(Dy), FF] 
[P(P(X)), P(P(Y)), fas] 
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[X, Y, I] (identity function) 
Res, f (restriction of f to set A) 


< (a partial order) 
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Tp (the projection of set X on its quotient by E) 
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[| Y;z (Cartesian product) 


rEX 


I] ¥2,¥a,m | (ath projection map) 
rEX 


I] ¥:, I] Ya, nal (A-projection map) 
LEX acA 


m,(A) (an A-cylinder with base A) 
Mage saa) {flee flew) = Mme (fed) 


card( A) 

No (aleph naught) 

¢ (cardinal number of continuum) 

2° (cardinal number of hypercontinuum) 

F (field) 

(X,R, +,-, x) (partially ordered vector space) 


at: _ 


x V @ (positive part) 


x: = (— 2) V 6 (negative part) 
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A. (Lebesgue content on 7?(R”)) 


A* (Lebesgue outer measure) 


Aj = Resc«A* (Lebesgue measure) 
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213 


213 
213 


216 


241 
247 
201 
255 
263 
264 
265 
265 
266 
274 
274 
274 
217 
283 
283 
283 
284 
290 
291 
ai3 
316 
316 
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L* (Lebesgue o-algebra) 


(IR”, £*, 45) (Lebesgue measure space) 

A = Resyis)A* = Resgirn) A* (Borel - Lebesgue measure) 

B = »'(r) (Borel o-algebra) 

(Q, B, w) (Borel measure space) 

Bi, (Lebesgue - Stieltjes o-algebra) 

[4g = Resg: 1" (Lebesgue - Stieltjes measure) 

ju; (Lebesgue - Stieltjes elementary content) 

ju¢ (Lebesgue - Stieltjes content, premeasure) 

ju; (Lebesgue - Stieltjes measure) 

jut (Borel - Lebesgue - Stieltjes measure) 

58(IR, B) (Borel - Lebesgue - Stieltjes measures) 

®, (extended distribution functions (d.f.'s)) 

®.|E: = {{f; uw}: € B} (quotient set of extended d.f.'s) 
8, (finite Borel - Lebesgue - Stieltjes measures) 

®D (set of all d.f.'s) 

uuf* (image measure) 

C~!(Q, 7; R, B(IR)) (real-valued measurable functions) 
C-1(Q, X; C, B(C)) (complex-valued measurable functions) 
R (extended real line) 

C4(Q, 2’; 1R, B(IR)) (extended valued measurable functions) 


(R®, Tp) (extended topology of pointwise convergence) 
C71(Q, ©; R)|,, (quotient set modulo 1) 


[0], (set of all measurable functions equal zero 1-a.e.) 
(0); = (0), 9c; 

Ge = eg, 2, R, , B(R, )) 

W.(Q, ©’) = %, (Q, 2’; R) (nonnegative simple functions) 


316 
316 
316 
317 
317 
317 
317 
aly 


317-318 


319 
319 
320 
321 
321 
321 
321 
336 
343 
343 
343 
344 
347 
350 
350 
350 
350 
354 


LIST OF SYMBOLS 


(VY, F+) (semi-4-space) 

W(Q, 3’) = W(Q, 2’; R) (simple functions) 

(W,(Q, &), rp) (extended nonnegative functions with limits) 
f* =sup{f, 0} 

f> = —inf{f,0) 

L1(Q, &, ps; R)-space 

(Q, ©, ps; R)-space 

C; ‘({a, b], B([a, b]); R) (real-valued bounded functions) 
L(f,P) (lower Darboux sum) 

U(f,P) (upper Darboux sum) 

S((a, b], B({a, b]), R) (space of all step functions on [a, b]) 


(R) { f(«)dax (Riemann integral) 
[a,b] 


L) f f(x)dx = f f(x)A(dx) (Lebesgue integral) 
[a,b] [a,b] 


(L((a, b], B([a, b)), A).|| ° Il.) 

v < yu (v absolutely continuous with respect to 11) 

MO, A’) is the set of all measures on (QQ, >’) 

MS = {v € M:v < py} (measures continuous relative to /1) 
ie (mapping defined on C-!(0, Y;R,, B(R+)) 

[C2*(Q, 2), DUS, fi 

(e-¥0, ¥), = 12, 2, R,)1,,MS, f] 

=f, (reducer of Ji ) 


oT 


mn (Radon - Nikodym derivative of v relative to 1) 


{Qo 7a) = a exp{ — cm \ (normal pdf) 
js L p (orthogonal measures) 


3B, Nt (families of positive and negative sets) 
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350 
300 
ao 
309 
309 
370 
370 
399 
399 
399 
400 
401 


401 


413 
424 
427 
427 


427 
428 
428 
429 
429 


430 


430 
436 


728 LIST OF SYMBOLS 


(P, N) (Hahn decomposition or Hahn pair) 440 
[H*, H~],, (Hahn class modulo v/) 440 
vt(-),v-(-), |v] =vt +v~ (Jordan variations) 44] 
6(Q, »’) (set of all signed measures on (2, »’)) 449 
G,(Q, 2’) (set of all finite signed measures) 449 
(G,(Q, 2’), < ) (vector lattice of finite signed measures) 449 
\|v|| = |v|(Q) (norm of signed measure v/) 451 


(G,(Q, 2’), || - ||) (normed space of finite signed measures) 45] 


((G.(Q, 2’), R, +, -, <, || - |])) (Banach lattice) 453 
@(Q, X’) (complex measures over the field C) 462 
\|€|| (norm of a complex measure C(Q, »’)) 462 
Gr ={ve ee 37): v < yu} (signed continuous measures) 472 
[L(O, ©, wsR)|,,6<, f.] 472 
ae a TE (derivatives of v and its Jordan variations) 481 


C& = {€ € C(Q, ’): € < uw} (complex continuous measures) 483 


1 ; 
OZ O),.e5L 484 
x +’; (measurable rectangles) 499 
Lig = 2 Ji = 2 (x x) (product o-algebra) 499 
i=1 i= 
Bz, (a;-section of set B) 504 
[Sx P(Q2), Fa;| (ai-section of a set) 504 
& hi (product measure) 509 
pa,0f (ai-section of a function f) 513 
(II Q; , & Si, & j:) (product measure space) 517 
i=1 B= nS 


C,(x0 , 7) (closed ball in R”) 517 


LIST OF SYMBOLS 


0, [1s 2 xX. | (product map of random variables) 
i=l tl 


Pox, = Px y = P(X; (joint probability distribution) 
i=l — oe 


(TI. @ 5;, ® Px) (probability product space) 
i=l i=1 i=1 
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225 


a2) 


52] 


Index of Names and Terms 


A 

ath projection map, 44 

ath coordinate, 44 

ath factor space, 44 

A-projection map, 50 

A-cylinder with base A, 50 

a.e. property, 349 

Abelian algebraic operation, 
67 

Abelian group, 67 

Abelian monoid, 67 

Abelian ring, 68 

Abelian semigroup, 67 

absolutely continuous 
measure, 424 

absolutely continuous signed 
or complex measure, 471 

abstract algebra, 67 

accumulation point, 106, 
ty7 

additive algebraic operation, 
67 

additive inverse, 68, 74 

additive set function, 263 

Al-Khwarismi, Muhammad, 
66 

No (aleph naught), 59 

Alexander, James, 172 

Alexander subbase 
theorem, 230 

Alexandrov, Pavel, 172 

algebra, 66, 240 


J.H. Dshalalow, Foundations of Abstract Analysis, 


algebra of functions, 127 
algebra over a field, 73 
algebraic number, 63 
algebraic operation, 67 
algebraic structures, 66 
associative algebraic 
operation, 67 
associative laws of sets, 7 
at most countable set, 59 
atomic (discrete) measure, 
267 
atomic probability measure, 
267 
axiom of choice, 36 
axiom of extent, 7 


B 

Baire's category theorem, 
136 

Banach, Stefan, 93, 149, 172 

Banach space, 151 

base for a topology, 188 

base for a topology, criterion 
188, 190, 194 

base neighborhood, 175 

base (simple) rectangle in 
product topology, 213 

base (simple) rectangle, 190 

base sets, 188 

Beppo Levi's corollary, 383 

Bernoulli, Johann, 14 

Bernoulli measure, 274 
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bijective (onto and one-to- 
one) map (function), 17 
bilinear transformation, 70 
binary relation, 14 
binary relation on a set, 29 
- antisymmetric, 29 
- equivalence, 29 
- linear total order, 29 
- partial order, 29 
- reflexive, 29 
- symmetric, 29 
- transitive, 29 
binomial measure, 274 
binomial random variable, 
339 
Bolzano, Bernard, 3 
Bolzano - Weierstrass 
compactness in metric 
spaces, 139 
Bolzano - Weierstrass 
compactness in topo- 
logical spaces, 229-230 
Borel, Emile, 141, 262 
Borel - measurable bounded 
functions, 399 
Borel measurable function, 
2o1 
Borel measure, 317 
Borel measure space, 317 
Borel outer measure, 317 
Borel - Lebesgue measure, 
316 
Borel - Lebesgue measure of a 
closed ball in R”, 517 
Borel - Lebesgue - Stieltjes 
measure, 317 
Borel o-algebra generated 


INDEX 


by a cocountable 
topology, 253 
Borel o-algebra, 247 
boundary of a set, 107, 177 
boundary point, 107 
boundary point of a set, 177 
bounded function, 134 
bounded sequence, 112 
bounded set, 34 
bounded (d-bounded) set, 
122 
Bourbaki, Nicolas, 171 
box rectangle, 215 
box topology, 215 
branch of a function, 16 


c 

C~'-class of functions, 336 

canonic chain of partitions, 
400 

canonic expansion 
(representation), 355 

canonic representation 
(expansion), 355 

Cantor, Georg, 3, 36, 58, 62, 
172 

Cantor ternary set, 327-328 

Cantor's theorem, 61 

Cantor's theorem on 
intersection of closed 
sets, 133 

Carathéodory, Constantin, 
240, 285 

Carathéodory's extension, 
uniqueness of, 297, 298 

Carathéodory's extension of 
a formatter, 284, 293 


INDEX 


Carathéodory's extension 
procedure, 283 

Carathéodory's extension 
theorem, 285 

cardinal number, 58 

cardinality, 58 

carrier, 5 

Cartan, Henri, 172 

Cartesian (direct) product, 
14 

Cartesian product of an 
indexed family of sets, 43 

Cartesian product of a 
sequence, 42 

Cauchy, Augustin-Louis, 
399, 498 

Cauchy continuity of a 
function, 125 

Cauchy integrable function, 
402 

Cauchy sequence, 112 

Cauchy sum, 402 

Cech, Eduard, 216 

chain, 34 

chain rule, 487 

change of variables for a 
bijective transformation, 
418 

change of variables formula, 
416 

change of variables formula 
with respect to the affine 
transformation, 419 

characterization of Jordan 
variations, 444 

Chevalley, Claude, 172 

choice function, 36 
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class of complex-valued 
measurable functions, 
343 

class of real-valued mea- 
surable functions, 343 

class of extended real-valued 
measurable functions, 
344 

closed ball, 108 

closed ball in R”, 517 

closed regular Borel set, 330 

closed regular Borel measure, 
330 

closed set, 174 

closed set, 106 

closure of a set, 106 

closure of a set, 176 

closure point for a set, 176 

closure point, 106 

cluster (accumulation) point, 
177 

coarser topology, 174 

cocountable (countable 
complement) topology, 
178 

codomain, 15 

cofinite (finite complement) 
topology, 182 

Cohen, Paul, 61 

commutative algebraic 
operation, 67 

commutative laws of sets, 7 

commutative ring, 68 

compact metric space, 138 

compact set in topological 
spaces, 228 

compact set under a 
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continuous function 
(metric space), 143 

compact set under a 
continuous map in a 
topological space, 228 

compact set, 138 

compact sets in Hausdorff 
spaces, 229 

compact topological space, 
228 

compactness criteria (metric 
space), 139 

compactness criterion 
(metric space), 140, 145 

comparable elements, 29 

comparison of sets, 58 

complement, 7 

complement of set B with 
respect to set A, 6 

complete Carathéodory's 
extension of a formatter, 
284, 293 

complete measure space, 
290 

complete measure, 283 

complete metric space, 112 

completeness criteria, 132 

completion of a measure 
space, 290 

completion of a measure, 
290 

complex measure space, 458 

complex measure, 458 

complex vector space, 73 

composition of binary 
relations, 18 

composition of continuous 


INDEX 


functions, 204 
composition of maps, 18 
composition of measurable 

functions, 257 
conditional expectation 

given a sub-algebra, 489 
conditional probability of an 

event given a sub- 

algebra, 490 
congruence modulo , 3 
congruence, 3 
conjugate exponents, 97 
consistent extension of a 

map (function), 18 
consistent function, 16 
content, 264 
continuity at a point, 117 
continuity criteria in metric 

spaces 119, 120 
continuity from above at the 

empty set, 264, 435 
continuity from above of a 

premeasure, criterion of, 

212 
continuity from above on a 

o-algebra, 264 
continuity from above on a 

sequence of sets, 264 
continuity from below of a 

premeasure, criterion of, 

274 
continuity from below ona 

o-algebra, 263 
continuity from below ona 

sequence of sets, 263 
continuity of a function, 

criterion of, 203, 204, 


INDEX 


205, 206 
continuity of projection 
maps, 206, 215, 224 
continuous from above set 
functions, 435 
continuous from below set 
functions, 435 
continuous function on a 
dense set, 208, 210 
continuous function on a set, 
Ny 
continuous function, 203 
continuous measure, 424 
continuum hypothesis, 61 
continuum, 59 
convergence in mean, 384 
convergence to a point, 178 
convergent sequence, 112 
coordinate of a function, 44 
Coulomb, Jean, 172 
countable (denumerable) set, 
59 
countable complement 
topology, 178 
countably compact 
topological space, 234 
counting measure space, 268 
counting measure, 268 
cover, 138 
criterion of independence of 
random variables, 527 
criterion of Lebesgue 
integrability, 401 


D 
d-bounded set, 122 
d-bounded function, 134 
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ds .-bound of a function, 136 

Darboux, Jean-Gaston, 399 

De Morgan, Augustus, 66 

De Morgan's laws, 8 

decomposition of a set, 9 

Delsarte, Jean, 172 

Denjoy, Arnaud, 406 

dense set, 114 

dense set, 177 

density, 422 

denumerable (countable) set, 
59 

Descartes, René , 42 

derived set, 107, 177 

diagonal, 200 

diameter of a set, 122 

difference, 7 

dimension of vector space, 
87 

Dieudonné, Jean, 172 

Dirac measure (point mass), 
265 

direct Riemann integral, 408 

direct Riemann integrability, 
408 

Dirichlet, Peter Gustav, 14, 
141 

Dirichlet function, 121, 364, 
403 

Dirichlet's criterion, 411 

discrete (atomic) measure, 
267 

discrete convolution, 71 

discrete metric, 95 

discrete topology, 173 

disjoint family of sets, 9 

disjoint sets, 6 
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distance, 94 

distance between two sets, 
130 

distribution function, 265, 
320 

distributive laws of sets, 8 

domain, 15 

Dynkin system, 240 


E 

Ehresmann, Charles, 172 

Eilenberg, Samuel, 172 

element of a set, 3 

elementary content, 264 

elementary event, 5 

embedded set, 209 

embedding, 209 

empty set, 4 

equipotent sets, 58 

equivalence class modulo 
= (E), 30 

equivalence kernel of a 
function, 31 

equivalent classes generated 
by a function, 31 

equivalent metrics, 121 

equivalent (similar) norms, 
164 

Euclidean distance, 98 

Euclidean metric, 98 

Euclidean norm, 151 

Euler, Leonard, 14, 499 

event, 5 

expectation of a function of 
a random variable, 419 

expectation of a random 
variable, 371, 418 


INDEX 


extended distribution 
function, 265, 320 

extended real line, 343 

extended real line topology, 
Nise) 

extended topology of point- 
wise convergence, 347 

extendibility conditions for 
the formatter, 284 

extendibility of a formatter, 
criterion of, 294 

extendible formatter, 284, 
Pie ie 

extension of the formatter to 
a measurable space, 284 

extension of a map 
(function), 18 


F 

Fy, set, 184 

factor (quotient) set, 30 

factor space, 43 

Fatou, Pierre, 361 

Fatou's lemma for 
functions, 384 

Fatou's lemma for measures 
and functions, 393 

Fatou's lemma for measures 
and nonnegative 
functions, 393 

Fatou's lemma for 
measures, 388 

field, 72, 240 

figure, 318 

finer topology, 174 

finite Borel - Lebesgue- 
Stieltjes measures, 321 


INDEX 


finite complement topology, 
182 

finite dimensional vector 
space, 87 

finite set, 59 

finite (v-finite) set, 435 

finite set function, 264 

finite signed measure, 435 

finite subadditivity, 270 

finitely additive set function, 
263 

first axiom of countability, 
179 

first countable topological 
space, 179 

formatter of the outer 
measure, 284, 292 

Fourier, Joseph, 14 

Frankel, Abraham, 36 

Fréchet, Maurice, 93, 149, 
171, 426 

Fubini, Guido, 361, 498 

Fubini's theorem, 515 

function, 14 

function continuous at a 
point, 203 

function continuous at a 
point, criterion of, 203 

function f,., 16 

function f,., 17 

fundamental system of 
neighborhoods at a point, 
175 


G 
Gs set, 184 
gamma function, 396 
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gauge function, 406 
gauge integral, 406 
generator, 247 
geometric measure, 448 
Grassman, Hermann, 67 
greatest lower bound 
infimum, 34 
Gregory, Duncan 66, 
group, 67 
automorphism, 69 
endomorphism, 69 
homomorphism, 68 
isomorphism, 68 


H 

Hahn class modulo, 440 

Hahn decomposition, 439 

Hahn decomposition 
theorem, 439 

half-open interval 
(rectangle), 242 

Hamel basis for a vector 
space, 85 

Hankel, Hermann, 67 

Hausdorff, Felix, 93, 171 

Hausdorff space, criterion 
of, 200 

Hausdorff (separated, T>) 
topological space 199 

Heine, Heinrich Eduard, 
120, 141 

Heine - Borel property, 161 

Heine - Borel theorem, 141 

Heine's continuity criteria in 
metric spaces, 120 

Helly, Eduard, 149 

Henstock, Ralph, 406 
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Henstock - Kurzweil 
integral, 406 
hereditary property in 
topological spaces, 226 
‘Hernandez - Lerma, 
Onésimo, 388 
Hilbert, David, 62 
Hobson, Ernest, 498 
homeomorphic topological 
spaces, 209 
homeomorphism, 209 
homeomorphisms and Borel 
o-algebras, relationship 
between, 259 
homothetic function, 336 
homothetic metric, 149 
Hopf, Heinz, 172 
Hdlder's inequality, 98 
hypercontinuum, 62 


I 
ideal, 278 
idempotence, 8 
of complement, 8 
of intersection, 8 
of union, 8 
identity function, 18 
identity element, 68 
image measure, 336 
image of a point, 15 
image of a set under a 
function, 16 
improper Riemann integral, 
406 
inconsistent function, 16 
indefinite integral, 422 
independence criterion of 


INDEX 


numeric random 
variables, 352 
independence criteria of 
random variables, 341 
independent Dynkin systems 
of events, 280 
independent families of 
events, 280 
independent random 
variables, 339 
independent o-algebras of 
events, 280 
indicator function, 18 
indiscrete topology, 173 
infimum, 34 
infinite dimensional vector 
space, 87 
infinite set, 59 
infinite signed measure, 435 
injective map (function), 17 
inner regular Borel set, 330 
inner regular measure, 330 
integrability of a complex- 
valued function, 462 
integral as a linear 
functional, 371 
integral of a complex-valued 
function with respect to a 
complex measure, 464 
integral of a complex-valued 
functions with respect to 
a positive measure, 462 
integral of a nonnegative 
simple function, 362 
integral of an extended 
nonnegative function, 
365 


INDEX 


integral of an extended real- 
valued function with 
respect to a complex 
measure, 463 

integral of an extended real- 
valued function with 
respect to a signed 
measure, 463 

integral of an extended real- 
valued function, 369 
integral with respect to 
Dirac point mass, 366 

integration by parts formula 
for Lebesgue - Stieltjes 
integrals, 524, 531 

interior of a set, 105, 175 

interior point, 105, 175 

intersection-stable system, 
241 

intersection, 6 

intersection-stable family of 
sets, 172 

into map (function), 17 

inverse, 68 

inverse image of function /,, 
IZ 

inverse of function , 17 

inverse of function f,, 17 

inverse of function f,,, 17 

invertible map (function), 17 

isolated closure point, 106 

isometric metric spaces, 338 

isometry, 338 

isometry between two metric 
spaces, 124 

isomorphic vector spaces, 87 

isomorphism, 87 
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J 

join operation, 35 

joint probability distribution 
of random variables, 527 

Jordan, Camille, 261 

Jordan decomposition of a 
complex measure, 458 
signed measure, 442 

Jordan variations, 441 


kK 

kernel, 69 

Klein, Felix, 67 

Kolmogorov, Andrey, 262, 
426 

Kronecker. Leopold, 62 

Kuratowski, Kazimierz, 172 

Kurzweil, Jaroslav, 406 


L 

I' space, 71, 96 

L' as a vector space, 371 

L' as a vector lattice, 378 

I space, 158 

1?-norm, 151 

I-V-Space, 347 

L'-seminorm, 413 

largest element, 34 

Lasserre, Jean, 388 

lattice, 35 

law of double expectation, 
490 

laws of algebra of sets, 7 

least upper bound 

supremum, 34 

Lebesgue, Henri, 141, 262, 
361, 498, 499 
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Lebesgue content, 313 

Lebesgue elementary 
content, 265 

Lebesgue integral, 370 

Lebesgue measure, 316 

Lebesgue measure space, 
316 

Lebesgue outer measure, 
316 

Lebesgue o-algebra, 316 

Lebesgue space, 370 

Lebesgue's dominated 

convergence theorem 
for functions, 385, 388 
for measures, 389 
for measures and 

functions, 394 
for complex-valued 
functions, 466 

Lebesgue's theorem for 
Riemann integrability, 
403 

Lebesgue - Stieltjes content, 
318 

Lebesgue - Stieltjes elemen- 
tary content, 266, 317 

Lebesgue - Stieltjes integral, 
370 

Lebesgue - Stieltjes measure, 
317 

Lebesgue - Stieltjes o-al- 
gebra, 317 

left distributive law, 68 

Leibnitz, Gottfried von, 14 

Levi, Beppo, 382, 498 

Lie, Marius, 67 

limit inferior of a sequence 


INDEX 


of sets, 10 

limit of a sequence, 10 

limit point of a sequence, 
112, 178 

limit superior of a sequence 
of sets, 10 

Lindel6f, Ernst Leonard, 139 

Lindeléf compact set in 
topological spaces, 228 

Lindel6f set, 139 

Lindeldf space, 139 

Lindeldf theorem, 139 

Lindeldéf topological space, 
228 

linear (vector) space over a 
field, 73 

linear combination, 85 

linear functional, 87 

linear hull (span), 85 

linear independence, 85 

linear manifold, 73 

linear map, 87 

linear operator, 87 

linear span (hull), 85 

linearity of the conditional 
expectation, 493 

linearly dependent finite set, 
85 

linearly disjoint manifolds, 
90 

Liouville, Joseph, 67 

Lipschitz condition, 125 

Lipschitz constant, 125 

Listing, Johann, 172 

lower Baire functions, 402 

lower bound, 34 

lower Darboux integral, 400 


INDEX 


lower Darboux sum, 399 
lower limit topology, 195 


M 
Mandelbrojt, Szolem, 172 
map, 15 
mapping, 15 
marginal sequences, 197 
maximal element, 34 
maximal linearly 
independent set, 89 
measurability of a complex- 
valued function, 462 
measurability of an extended 
real-valued function, 
criteria of, 345 
measurability of a function, 
criterion of, 256 
measurable 
function, 255 
rectangle, 499 
sets, 241 
space, 241 
subset, 5 
measure, 264 
measure generated by an 
integral, 422 
measure of a hyperplane, 
jal 
measure space, 264 
meet operation, 35 
mesh of a partition, 400 
metric, 94 
metric space, 94 
metric topology, 173, 174 
metrizable topological 
space, 174 
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metrization, 94 
minimal element, 34 
Minkowski, Herman, 160 
Minkowski spaces, 160 
Minkowski's inequality, 98 
modulus, 77 
modulus of congruence, 31 
moment generating function, 
367 
moment generating function 
of a random variable, 523 
monoid, 67 
monotone convergence 
theorem 
for a sequence of 
nonnegative of 
functions 
on a signed measure 
space, 467 
for functions, 382 
for measures, 392 
for the conditional 
expectation, 493 
generalized, 383 
monotone nondecreasing 
sequence of sets, 9 
monotone nonincreasing 
sequence of sets, 9 
monotone system, 241 
monotone vanishing 
sequence of sets, 9 
monotonicity 
of measures, 268 
of outer measure, 283, 
285 
of the conditional 
expectation, 493 
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Moore, Eliakim, 172 

Moore plane, 227 

motion-invariant measure, 
338 

motion, 338 

j-integrable function, 370 

ji-minimal decomposition of 
a set, 270 

ji-negligible (negligible) set, 
290 

ju*-measurable set, 283, 285 

ju*-separability, 283, 285 

/ig-measure, 283 

ji-a.e. property, 349 

ji-null (null) set, 290 

multivalued function, 15 

multiplicative algebraic 
operation, 67 

multiplicative inverse, 68 


N 

N(A)-tail of a sequence, 

178 

N,,-set, 290 

N(a,¢€)-tail, 112 

negative part, 77 

negligible (ju-negligible) set, 
290 

negative (v-negative) set, 
435 

negative variation of a 
signed measure, 441 

neighborhood base at a 
point, 175 

neighborhood of a function, 
51 

neighborhood of a point that 


INDEX 


meets a set, 176 
neighborhood of a point, 175 
neighborhood system at a 

point, 175 
Nikodym, Otto, 361, 425 
non-Borel set, 333 
nonnegative simple 

functions, 354 
norm, 150 
normal density, 430 
normal random variable, 430 
normed vector space NVS, 

150 
nowhere dense set, 114, 177 
v-maximal set, 446 
v-minimal set, 446 
null (~-null) set, 290 
null space, 88 
NVS, 151 


O 

one-to-one map (function), 
17 

onto map (function), 17 

open ball with respect to the 

Euclidean metric, 102 

open ball with respect to the 
supremum metric, 102, 
103 

open ball, 101 

open cover, 138 

open cylinders, 194 

open map, 222 

open neighborhood of 
function f, 218 

open neighborhood of a 
point, 175 


INDEX 


open parallelepiped 
(rectangle), 190 

open rectangle 
(parallelepiped), 190 

open (d-open) set, 103 

open sets, 172 

open subcover, 138 

origin, 73 

orthogonal measures, 430 

orthogonal transformation, 
338 
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product measure space, 517 
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88 
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set function, 263 
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setwise convergence, 
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